SOME APPLICATIONS OF THE RAYLEIGH-RITZ 
METHOD TO THE THEORY OF THE 
STRUCTURE OF MATTER! 


HUBERT M. JAMES 


I have been asked to discuss for you some applications of the Ray- 
leigh-Ritz method in physics. Rather than attempt to survey the 
whole of a discouragingly large field I have preferred to restrict my 
attention to some few topics from the theory of atomic and molecular 
structure. The application of the Rayleigh-Ritz method in this field 
has been carried out with some care and persistence, and there are 
enough results available to make possible a critical discussion of the 
potentialities and limitations of the method. This field also presents 
good examples of how one’s mathematical procedure is influenced by 
one’s physical notions concerning the system to be studied, and, in 
turn, what sort of physical conclusions may be drawn from results 
yielded by the method. 

First of all I shall indicate why recourse to the Ritz? method is 
necessary if one is to obtain a satisfactorily accurate treatment of 
some of the principal problems in atomic and molecular theory. I shall 
indicate, though not in as much detail as I should like, the nature of 
the considerations by which one chooses the set of coordinate func- 
tions employed in this method, and show how rapid a convergence 
one can hope for in the approach to the solution. After presentation 
of some of the very satisfactory results one can obtain, by the ap- 
plication of enough hard work, in not too unfavorable cases, I shall 
finally have to indicate the increasingly serious limitations of the 
method as one seeks to apply it to the treatment of more and more 
complicated physical systems. 

The fundamental mathematical problem in the study of an atomic 
system by wave mechanics is usually the solution of the stationary 
state wave equation for that system. In the non-relativistic case this 
is a linear partial differential equation of the second order, which can 
be symbolized thus: 


Hy = Ey. 


Here H represents the Hamiltonian operator for the system obtained 


1 An address delivered before the meeting of the Society in Washington, D. C., 
on May 3, 1941, by invitation of the Program Committee. 
2 W. Ritz, Journal fiir die reine und angewandte Mathematik, vol. 135 (1909), p. 1. 
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from the Hamiltonian function H of classical mechanics by the opera- 
tor substitution 


h @ 
0g: 
where h is Planck’s constant. For instance, for a hydrogen atom with 


fixed nucleus H is the sum of the kinetic energy of the one electron 
and the potential energy of the electron with respect to the nucleus: 


e? 


1 
H=—(p.+ P+ 
2m r 


The Hamiltonian operator correspondingly involves the Laplacian 
operator in a term representing the kinetic energy of the electron 
and a scalar multiplier representing the potential energy: 


h2 e 
The wave equation is then 
2 
Vy ——v= Ey. 
822m ¥ r 


In these equations E is a number, the energy of the system, and yf, 
the corresponding solution, is the wave function which describes the 
state of the system when it has that energy. y is obviously a function 
of all the spatial coordinates needed to define a configuration of the 
system. In the case of a simple molecule like Cle, for instance, Y would 
depend on 108 variables. It is this multiplicity of variables which 
makes it so hard for physicists to interest mathematicians in some 
problems they would like very much to have solved. 

Now not all the solutions of the wave equation are of interest, but 
only those which can be physically interpreted. This requirement of 
physical interpretability places certain auxiliary conditions on the 
functions ¥, often merely that they be quadratically integrable or 
that they remain bounded. Between the physically interpretable wave 
functions and the stationary states in which the system may be found 
there is a one-to-one correspondence. Thus in many cases physically 
satisfactory solutions of the wave equation exist only for certain dis- 
crete values of E, and in such cases one finds that the physical system 
also can possess only these discrete energies. 

The physically satisfactory solutions y of such an equation and the 
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corresponding energies E form the raw material for any detailed 
quantitative treatment of the atomic system. For instance, a mole- 
cule can emit radiation when its energy changes from one of the dis- 
crete possible values to another. One can compute the average 
intensity of such radiation as soon as one knows the yw functions 
for the initial and final stationary states, while the wavelength can 
be determined at once from the corresponding E’s. Thus by deter- 
mining all the physically acceptable y’s and E’s which satisfy the 
wave equation for a molecule one could make a complete prediction 
of the wavelengths and intensities in the spectru:n emitted by a gas 
consisting of these molecules, under any specified conditions. These 
wave functions are similarly fundamental in the study of collisions 
and other interactions between atoms or molecules, the absorption 
and scattering of radiation, and so on. The difficulty is to get reason- 
ably accurate y functions on which to base further work. 

In the simplest case of the hydrogen atom a complete solution of 
the problem is possible. The wave equation is separable in spherical 
coordinates, and one has to deal only with ordinary differential equa- 
tions which offer no special difficulty. However, when one passes to 
the next more complicated problem, the helium atom with a heavy 
nucleus and two electrons, one meets most of the difficulties charac- 
teristic of this work. 

If we treat the nucleus of the atom as fixed, the wave equation in- 
cludes terms representing the kinetic energies of the electrons, the 
potential energies of the first and second electrons with respect to the 
nucleus, and their potential energy with respect to each other: 


h? 2 2 2e? 2e? e 
(i+ Wn, +(-——- = 4+ 
822m ry 
= Ey(n, f2). 

It involves six independent variables or two vectors 1), f2, defining the 
positions of the two electrons; one might, for instance, take as three 
of these variables the separations 7", 72, 712. of the three particles, on 
which the potential energy terms depend, and as the other three vari- 
ables the angles necessary to define the orientation in space of the 
triangle formed by the three particles. 

The fundamental difficulty of this equation arises from the fact 
that it is not completely separable. One can separate out the angle 
variables just mentioned, but there remains a partial differential 
equation in three variables which is not separable in any set of co- 
ordinates. In the very simplest case, in which y does not depend on 
the angle variables at all, the separated equation becomes 
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This equation is not only complicated, but unpleasantly singular, the 
potential energy becoming infinite as any one of the three variables 
approaches zero. Though it does not seem outside the range of possi- 
bility, no one has yet been able to set up a direct method for solving 
this equation or even for proving the existence of solutions. One must 
immediately resort to approximations. 

One method by which physicists often deal with such equations is 
that of forcing a separation of variables by neglect of judiciously 
chosen small terms, the resulting ordinary differential equations then 
being treated by the customary methods. This and the closely related 
methods of Hartree* and Fock‘ actually serve to give useful approxi- 
mate solutions of atomic wave equations involving several hundreds 
of variables. Peculiarly enough, the limitations of these methods are 
in practice apt to be more serious the lighter, the less complicated, 
the atom. 

In the case of the state of helium with lowest energy, the ground 
state, computations by this method become quite simple. If one 
makes the very crude approximation of neglecting the mutual re- 
pulsion of the electrons, as represented by the term e?/r, the ap- 
proximate wave function becomes 


y = | 


the unit of length being the atomic unit ay=5.28X10-*° cm. The 
atomic energy computed with this function is — 76.43 electron volts, 
against an observed value of —80.62 electron volts. Neglect of a 


group of terms, 
# 
32 Lal T2 


2D. R. Hartree, Proceedings of the Cambridge Philosophical Society, vol. 24 
(1928), p. 189; J. C. Slater, Physical Review, vol. 32 (1928), p. 339. Results published 
in many papers of Hartree, mostly in Proceedings of the Royal Society of London, 


series A. 
4 V. Fock, Zeitschrift fiir Physik, vol. 61 (1930), p. 126. 
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which is on the average smaller than the single term e?/ri2, leads to 
a better wave function, y= Ce—@7/19("1+"2), and a computed atomic 
energy in error by 1.53 electron volts. Further improvements are not 
easily obtained, and by pressing this method to its limit one can re- 
duce the error in the computed energy only to 0.81 electron volts. 
To obtain wave functions and energy values more accurate than 
these, as one very much wants to do, one must resort to the varia- 
tional formulation of the problem. One then seeks to determine the 
wave functions, not as solutions of a differential equation, but as the 
properly continuous functions which extremalize the average energy 


Y*Hydr 


the corresponding extremal values of this quantity are then the sta- 
tionary state energies of the system, which one wants to determine. 

A variety of variational procedures have been applied in the theory 
of atomic structure, particularly in the study of the lowest energy 
states of the systems. The correct wave function for such a state 
makes the computed energy an absolute minimum. An approximate 
wave function will then presumably be better the lower—that is, the 
more nearly correct—the corresponding energy. It is accordingly a di- 
rect, and very common, procedure to set up approximate wave func- 
tions known to have a generally appropriate form, but involving 
adjustable parameters a, 8, - - - . Adjusting these parameters to mini- 
mize the average energy, one obtains the best approximate wave func- 
tion of the assumed form, and possesses in the minimum value of E 
the most favorable upper limit on the ground state energy of the 
system. 

For instance, one might assume a function of the form given by 
a forced separation of variables, but treat the constant in the expo- 
nential, which is the effective charge of the nucleus, as a variable 
parameter: 


? 


W(a; f2) = 


The best value of the parameter a is 27/16, the energy error being 
then 1.53 electron volts. Or one may set up the more elaborate and 
more flexible function, 


The best values of the parameters are then a=2.15, 8=1.19, and the 
error in the computed energy is only 0.76 electron volts, less than that 
given by any forced separation of variables. 
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Such schemes for the variation of parameters are often useful, but 
they have one serious limitation. In general an excessive amount of 
labor is required to determine the best values of the parameters, par- 
ticularly if these occur in exponentials. As a result, the number of 
parameters which can be handled, and thus the flexibility of the trial 
functions, is seriously limited. This is one of the points at which the 
Rayleigh-Ritz method is superior to all others. Without excessive la- 
bor one can deal with Ritz functions containing as many as twenty 
parameters, whereas with other schemes one is in practice limited to 
two or three. The result is that the Ritz method is quite without a 
rival in the accurate computation of wave functions and atomic ener- 
gies—as distinguished from roughly approximate computations. 

The critical step in the application of the Ritz method is the choice 
of the set of coordinate functions in terms of which the wave functions 
are to be expressed as sums. Here one must balance against the need 
for reasonable ease in the computations the requirement of a general 
suitability in the character of the functions chosen, for even the con- 
siderable flexibility of the Ritz function in practice is not sufficient to 
compensate an inappropriate choice of the form of the component 
terms. One must bring to bear on this problem all the physical consid- 
erations which may be available. In the case of the ground state of 
helium the relatively simple arguments would run as follows: 

First of all, since the electrons are identical the Hamiltonian opera- 
tor is symmetrical to interchange of their coordinates. From this it 
follows easily that every He wave function should be either sym- 
metric or antisymmetric to interchange of the coordinates of the elec- 
trons. States of the atom described by symmetric functions are those 
which the spectroscopist would class as singlet states, while the others 
are triplet states. The ground state of helium is known to be a singlet 
state; hence in attempting to determine its wave function one need 
consider only functions symmetric to interchange of the electrons. Simi- 
lar general arguments,5 combined with the spectroscopic observation 
that the ground state of helium is an S state, show that this wave 
function does not depend at all on the angle variables, but depends 
only on the distances between the particles. 

There are other arguments of a less precise character. For instance, 
if either electron of the atom is very far away from the nucleus the 
chance is overwhelming that the second electron will be, relatively, 
very near to it. The electric field at the first electron would then be 
nearly the same as if the second electron and the nucleus were super- 


5G. Breit, Physical Review, vol. 35 (1930), p. 569. 
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imposed—that is, almost Coulomb in character. Now we know that 
if the field were exactly Coulomb the wave function would decrease 
exponentially at large distances. Hence we can conclude that the situ- 
ation will be similar in helium; the wave function should decrease ap- 
proximately exponentially as the distance of either electron from the 
nucleus becomes very great. 

This fact would be indicated by the approximate wave functions 
given by any of the methods previously discussed. But there is an- 
other important feature of the correct function which none of those 
approximate functions possesses; not one of them depends on the dis- 
tance between the electrons. This is physically quite unreasonable, 
for it implies that the probability of any configuration of the system 
depends only on the electron-to-nucleus distances, and is just the 
same whether the electrons are very close together, on the same side 
of the nucleus, or far apart, on opposite sides of it. Obviously the 
mutual repulsion of the electrons will tend to keep them away from 
each other; correspondingly the wave function must tend to decrease 
in magnitude with decreasing separation of the electrons. It is the inabil- 
ity of most approximate wave functions to take account of this fea- 
ture in the behavior of the system that is responsible for their largest 
errors. 

How then shall one choose the terms from which the Ritz function 
is to be constructed? Formally, the simplest choice would be to make 
the Ritz function a power series, with the typical term, 

mn p 
11 ToT 12. 
This is impossible, however, for no finite sum of such terms would 
show the desired exponential decrease for very large 7; and 172; the 
very integrals in terms of which the variational procedure is stated 
would not converge. But one can obtain the desired asymptotic char- 
acter of the function, and assure the existence of all relevant integrals, 
by introducing an exponential factor into each term: 
—5(ritre) mn p 
é 71 T2712. 
Introduction of an exponential factor in rz might seem appropriate, 
but it would be quite unnecessary, and would only make the computa- 
tions more tedious. A sum of such terms, in addition to the linear 
parameters characteristic of the Ritz method, would contain an expo- 
nential parameter for which it would be much more difficult to deter- 
mine the very best value. Fortunately in practice one need worry 
little about the exact value assigned to such parameters; they may 
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be fixed once for all by an exploratory computation with a very few 
terms, or assigned a generally appropriate constant value on the basis 
of physical considerations of effective nuclear charges. 

A finite sum of such terms can have all the properties one desires 
in respect to flexibility, asymptotic behavior and simplicity, but may 
lack the proper symmetry to interchange of the coordinates of the 
electrons. This symmetry one can obtain in various ways. For in- 
stance, one may construct the terms from powers of 71-+72 and 1—f2: 


(ry + re)" (r1 ries 


and assure the desired symmetry by including terms with all powers 
of (r1+72), but only even powers of r1—1r2. These were just the coordi- 
nate functions used by Hylleraas in his pioneer work on the theory 
of the helium atoin.® 

In his first work Hylleraas obtained a quite simple six-term func- 
tion, 


y=e 


é 


—1.818(r3+1r2) { 


Co + + Corie + + re) 
ca(ri 12)? + = r2)}. 


In the atomic units Ry.k the energy computed with this function is 
— 5.80633, against the observed value of — 5.80736 + .00012; the error 
of .0139 electron volts is about one-sixtieth that of the best previous 
computation. In later work’ with an eight-term function the com- 
puted energy was depressed to —5.80748. It might seem embarrass- 
ing to the theory that the computed upper limit on the energy should 
come below the observed energy. However, the wave equation treated 
by Hylleraas involved certain approximations, neglecting relativistic 
effects and in part, the motion of the nucleus. Adding a correction 
of +0.00010 for the effect of nuclear motion and one of +0.00018 for 
relativistic effects* one obtains a theoretical value of — 5.80720 for the 
ground state energy of helium. This agrees with the observed value 
to one part in 30,000, practically within the accuracy of observation, 
and, as was to be expected, lies slightly too high. 

This very satisfactory treatment of the three-body problem in wave 
mechanics is of historical interest in that it gave the first quantita- 
tive check on the present theory of atomic structure as applied to 
systems more complicated than the hydrogen atom. Many theories 
have been devised to account for the very simple properties of hydro- 


6 E. A. Hylleraas, Zeitschrift fiir Physik, vol. 54 (1929), p. 347. 
7E. A. Hylleraas, Zeitschrift fiir Physik, vol. 65 (1930), p. 209. 
8 H. Bethe, Handbuch der Physik, XXIV/1. See pages 371, 383. 
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gen, but no other one of them has been able to stand the test of exten- 
sion to the treatment of helium. The confidence of the physicist in the 
essential correctness of the present atomic theory is of course based 
on a wide variety of comparisons with experiment. A number of other 
equally impressive examples of this particular quantitative type have 
been obtained through application of the Ritz method, and will be 
presented later. 

By making use of the observed energies of the higher stationary 
states of helium one can fix an upper limit on the error of these wave 
functions.* One can show, for instance, that the root-mean-square 
error in the six-term function does not exceed 3 per cent; a reasonable 
estimate is 1 per cent. For the more complicated function the error is 
of the order of 1/5 per cent. Such accuracy is more than adequate 
for the purposes of the physicist. 

Similar results can be obtained in studies of the lower excited states 
of helium. These states differ from the ground state in one important 
physical respect. In the ground state the two electrons are said to be 
equivalent; the most probable configurations of the atom are those 
in which the two electrons are at about the same distance from the 
nucleus. In the lower excited states of the atom just one of the elec- 
trons is excited; the most probable configurations are those in which 
one electron is much farther from the nucleus than the other. The 
effective nuclear charges are then different for the two electrons, that 
of the outer electron being reduced by its repulsion by the inner elec- 
tron. This one should take into account in constructing the Ritz func- 
tion, replacing 

—8(ritr2) mn 


P 
11 T2712 


by 


—berg mn 
é 71 71712. 
The terms in the Ritz function are then to be constructed by making 
appropriate linear combinations of these functions, symmetric in the 
case of 1S terms, and antisymmetric in the case of *S terms. 

If the excited state to be considered is not the lowest of its sym- 
metry one does not obtain the energy of the state as the lowest root 
of a secular equation, but as the second lowest root, third lowest root, 
and so on. Fortunately one can show that an approximate energy thus 
computed is always an upper limit for the energy of the corresponding 
state of the atom, approaching the correct value from above as the 


°H. M. James and A. S. Coolidge, Physical Review, vol. 51 (1937), p. 860. 
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flexibility and accuracy of the Ritz function are increased.!*-!! Conse- 
quently the error in the computed energy provides an indication 
(which can be made quantitative) of the accuracy of the correspond- 
ing function. This is an outstanding advantage of the Ritz method 
over all other practical methods of treating excited states, in which 
an indifferent approximate wave function can by chance lead to the 
correct value of the energy. 

Among the results of computations on excited states of helium are 
the following: The lowest excited state of helium is a *S state. The 
observed energy is —4.3505 in the atomic units Ryz,h while Hylleraas 
and Undheim,"® using a six-term function, have computed an energy 
of —4.3504, an error of about one part in 40,000. The second excited 
state is a 1S state, with an observed energy of —4.2920, while Coolidge 
and James'* have used an eleven-term function to compute the value 
—4.2916, in error by one part in 10,000. The greater complexity of 
the function needed in the case of the more highly excited state will 
illustrate a general feature of computations with the Ritz method: 
convergence on the correct function becomes slower the more highly 
excited is the state considered. This factor, together with the increas- 
ing usefulness of certain approximate methods, limits the interest in 
extending Ritz computations to very highly excited states. 

Another simple and important problem is that of the hydrogen 
molecule, Hz. This provides illustrations of the results obtainable 
with the Ritz method in rather different and more complicated cir- 
cumstances, and also material for some general remarks about the 
method, which can here be based on more extensive studies than in 
the case of helium. 

The problem of treating He, a system containing two heavy nuclei 
and two light electrons, can be reduced to that of solving the wave 
equation for the motion of the electrons in the field of the nuclei, con- 
sidered as fixed. The wave equation is then 


ht 2 2 e? e? e? e” e? 
2 


= E(R)¥(R; £2). 
This partial differential equation involves the six coordinates of the 


two electrons as independent variables, and the fixed separation of 
the nuclei, R, as a parameter. It contains terms corresponding to the 


10 E. A. Hylleraas and B. Undheim, Zeitschrift fiir Physik, vol. 65 (1930), p. 759. 
11 J. K. L. MacDonald, Physical Review, vol. 43 (1933), p. 830. 
2 A. S. Coolidge and H. M. James, Physical Review, vol. 49 (1936), p. 676. 
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kinetic energies of the first and second electrons, the potential energy 
of the first electron at a distance 1; from the first nucleus and s; from 
the second nucleus, the potential energy of the second electron in the 
field of the nuclei, the potential energy of repulsion of the two elec- 
trons, and the potential energy of repulsion of the two nuclei. Again 
the difficulty in solving the wave equation arises from the non-separa- 
bility of the variables. In this case at most one variable can be sepa- 
rated out, and every exact solution depends on at least five variables 
—for instance, 71, fe, $1, $2, 

The physical considerations on which one can base approximate 
solutions of this equation are interesting but so extensive and compli- 
cated that I cannot undertake to present them here. I can suggest 
only one of several possible lines of thought. If, for instance, one neg- 
lects the repulsion of the electrons the problem becomes completely 
separable. To this approximation, each electron moves as it would in 
the absence of the second electron—that is, as if it were the one elec- 
tron of an ionized hydrogen molecule. Correspondingly the wave 
function becomes a product of two functions, each a solution of the 
wave equation for H+. 


ri, = xi(R; £1)x2(R; 42), 


h? 2 e? e? 


$1 


Now this latter equation is separable in elliptic coordinates 
M=(n+s1)/R, $1=azimuth about the internuclear 
axis, and the form of the solutions can be determined in detail. In 
the cases to be mentioned later the interesting solutions are independ- 
ent of ¢ and can be well approximated by a sum of a few terms of the 
form e~”\"u/. This suggests that one might well form the terms of a 
Ritz function for hydrogen as products of such factors, one for each 
electron, multiplied by various powers of the interelectronic distance 
to take account of the tendency of the wave function to decrease as 
the electrons approach each other: e~?1*"\Tjulubr?,. The symmetry 
or antisymmetry required of the function according as one treats sin- 
glet or triplet states of the system can be assured with only slight com- 
plication of the form of the terms: + MAT 
Coolidge and I"* have used just such terms in treating the ground 
state of Hz. For application to excited states of the molecule only 
one slight modification is necessary. Since the electrons are no longer 


13 H. M. James and A. S. Coolidge, Journal of Chemical Physics, vol. 1 (1933), p. 
825; ibid., vol. 3 (1935), p. 129. 
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equivalent, one must introduce two different exponential parameters 
for the two electrons, corresponding to the different effective nuclear 
charges. 

In working with these functions we have made extensive studies of 
the way in which the computed energy converges on the correct en- 
ergy as more and more terms are added to the Ritz function. The con- 
vergence behavior of course varies with the chosen form of the terms, 
but when a physically appropriate choice has been made the conver- 
gence follows a remarkably similar pattern, whether one is working 
with some state of the hydrogen molecule, or the helium or lithium 
atoms, or the lithium molecule Liz, or any other system with which 
we have had any experience. 

In a favorable case, of which the ground state of Hz is typical, the 
convergence is very simple and regular. A presentable approximate 
function and energy value can be obtained with relatively few terms 
—-say five or six. Thereafter one finds that the small improvements 
obtained on adding further terms to the function are nearly additive. 
Thus a term which produces an improvement of .01 volt in the com- 
puted energy when it is added to a five-term function will give nearly 
the same improvement when added, separately or together with other 
terms, to a ten-term function. Deviations from strict additivity of im- 
provements are almost invariably in the direction of a reduction in 
the importance of a term when it is added to a better function. Thus 
it is possible to put an upper limit on the error arising from neglect of 
any given term by trying it out together with a small number of 
others. This greatly reduces the labor of eliminating useless terms 
from consideration and getting the simplest possible Ritz function of 
a given accuracy. 

Another important feature of the convergence is the rapid and regu- 
lar falling off in importance of the terms in a sequence in which the 
value of any single exponent is increased. Thus in one case the term 
with exponents (0, 0, 1, 0, 0) was one of the most important terms, 
(0, 1, 1, 0, 0) gave an improvement of .008 electron volts, and 
(0, 2, 1, 0, 0) an improvement of .0015 electron volts. This rapid con- 
vergence makes it relatively easy to estimate closely the improvement 
in energy to be obtained on adding to the Ritz function all the terms 
of a given sequence. The near independence of the improvements due 
to the different sequences then makes possible a useful estimate of the 
convergence limit as all terms of the complete set are added to the 
Ritz function. For example, we have obtained the following results in 
treating the ground state of He. A five-term function gave the energy 
of the molecule as —2.3329 atomic units, against the observed value 


1941] RAYLEIGH-RITZ METHOD 881 


of —2.3493—a result already far more accurate than has been ob- 
tained by any method other than the Ritz method. A thirteen-term 
function gave the energy as —2.3470, correct to 1/10 per cent; the 
root-mean-square error in that function itself is of the order of 1 per 
cent. Finally, a careful convergence estimate led to the energy value 
— 2.3492 + .0008. When this value was published it was considerably 
more reliable than any available experimental result, but a later very 
accurate determination by Beutler and Jiinger!* gave the experi- 
mental value quoted above, in complete agreement with the apparent 
convergence limit of the Ritz method. 

In treating some of the excited states of Hz we have encountered 
a less simple convergence behavior.!5 Sometimes it happens that two 
terms will cooperate; that is, they will together improve the computed 
energy by more than the sum of the improvements they would pro- 
duce separately. We have not encountered any case in which three or 
more terms have formed a cooperating group. When such cooperation 
occurs it is usually easy to understand in physical terms. In practice 
in treating a given problem one quickly finds out whether such co- 
operation is likely to occur and can predict fairly accurately what 
pairs of terms will show it. Somewhat more care is required in dis- 
carding terms and making convergence estimates, but the difficulty is 
not serious. 

Somewhat more annoying is the general tendency toward slower 
convergence in the case of the excited states. With the ground state 
five terms suffice to give a very useful function, while with an excited 
state ten terms may be necessary to obtain similar accuracy. This in- 
creasing slowness of convergence is most marked when one deals with 
the higher states of a given symmetry. Thus our best function for the 
ground state, a !, state, contains thirteen terms. To get a somewhat 
less accurate result for the first excited state of this character sixteen 
terms were needed, and we have never succeeded in getting a usable 
approximation for the second excited 12, state, which, incidentally, 
would be of particular physical interest. 

Despite these difficulties the Ritz method has yielded very useful 
results for several excited states of H2.!5 The maximum accuracy for 
practical work was obtained with functions containing about sixteen 
terms, after many other terms were tried and discarded as negligible. 
These functions are accurate to the order of 2 per cent, the computed 
energies to about 0.1 per cent. 


14 H. Beutler and H.-O. Jiinger, Zeitschrift fiir Physik, vol. 101 (1936), p. 304. 
1H. M. James, A. S. Coolidge and R. D. Present, Journal of Chemical Physics, 
vol. 4 (1936), p. 187; A. S. Coolidge and H. M. James, ibid., vol. 6 (1938), p. 730. 
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The physical significance of such applications of the Ritz method 
may be illustrated by the results of a computation in which use was 
made of these wave functions.** When H: is bombarded with electrons 
having more than 12 electron volts of energy it emits a strong con- 
tinuous spectrum in the ultraviolet. This arises when molecules ex- 
cited to the stable 1sa2so *2, state pass over into the lower and 
unstable 1so2pa *2, state. If the energy of the bombarding electrons 
is carefully kept to a minimum or if enough helium is present in the 
discharge this spectrum takes cn a particularly simple and reproduc- 
ible character, for the vibrations of the molecule in the initial elec- 
tronic state are suppressed. Using Ritz wave functions for both 
electronic states of the molecule we have made a complete theoreti- 
cal prediction of the intensities in this spectrum. The results appear 
in Figure 1, together with the observational results of Smith, the best 
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available at the time. The agreement was not to be regarded as satis- 
factory. Since we had complete confidence in the theory we were 
forced to the conclusion that the experiments were in error, and that 
the intensity standards used by Smith were not satisfactory for work 
in this very difficult region of the spectrum. This has now been proved 
to be correct by the preliminary results of some very careful experi- 
mental work by Dr. Coolidge, also shown in Figure 1. On employing 
improved intensity standards, he finds that the gross disagreement of 
theory with experiment disappears. It is true that there remain dis- 
crepancies which are larger than one would like. This seems to be due 


16H. M. James and A. S. Coolidge, Physical Review, vol. 55 (1939), p. 184. 


+ 
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to difficulties with the intensity standard, which is still less reliable 
than the theoretical computations. It would not be at all unreason- 
able to work backward, from the theory and the observations of 
Coolidge, to an improved calibration of the standard source. 

Now a somewhat less satisfactory aspect of this subject must be 
mentioned. While the helium atom and the hydrogen molecule are of 
rather special interest to physicists they are also almost the simplest 
systems of their respective types, much too simple to be entirely 
typical. Unfortunately one soon finds on attempting to extend the 
Ritz method to the treatment of other atomic systems that the diffi- 
culties are numerous and serious. To indicate their character I need 
only to pass to the case of lithium. 

The lithium atom consists of a nucleus and three electrons. In the 
simplest possible case the wave functions depend on six essential and 
non-separable variables—say the six distances between pairs of par- 
ticles. The first discouraging effect of this greater number of variables 
is the corresponding increase in the number of potentially important 
Ritz terms with all exponents small. In passing from the hydrogen 
molecule with five essential variables to the lithium atom with six 
there is actually an increase from eight to thirty-two of the Ritz terms 
in which no exponent exceeds 1. The number of sequences of terms to 
be investigated in making convergence estimates is correspondingly 
increased. 

Each term is also more difficult to work with. The increased num- 
ber of electrons makes the symmetry conditions more complicated,” 
so that each Ritz term is, at the simplest, the sum of six parts, in- 
stead of the two parts needed with helium or hydrogen. There are 
thus more integrals to be evaluated in setting up the secular equation 
for the problem, and they are more difficult to evaluate because of 
the greater number and the awkward character of the variables of 
integration. 

In the case of lithium it is possible to overcome these difficulties. 
The observed total energy of the lithium atom is —14.9578 atomic 
units. Using a 17-term function we have computed the energy 
— 14.9566, in error by one part in 10,000. Of more physical interest, 
perhaps, is the energy needed to ionize the atom. Here the experi- 
mental result is 5.364 e.v., while our convergence estimate leads to 
the theoretical value 5.363 +.007 e.v. 

It does not seem useful, however, to apply the Ritz method in ac- 
curate treatments of atoms more complicated than lithium. The diffi- 


17H. M. James and A. S. Coolidge, Physical Review, vol. 55 (1939), p. 873. 
18 H. M. James and A. S. Coolidge, Physical Review, vol. 49 (1936), p. 688. 
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culties increase enormously and dampen effectively any interest one 
may have in this method of studying the systems. In the case of mole- 
cules the difficulties are even greater; to give an accurate treatment 
of the molecule Liz is much more than twice as difficult as to give a 
corresponding treatment of the lithium atom. It is true that the Ritz 
method has been fruitfully applied to the treatment of LiH,!* and of 
Lie,2° and especially of the ionized molecules!® LiH*+ and Li#,?! but 
here again the limits to the extension of the method seem to have been 
reached. 

In conclusion I should emphasize that this particular field of atomic 
physics has been discussed, not as a summary of all that has been 
accomplished in physics using the Rayleigh-Ritz method, but rather 
as an illustration of how the method is likely to work out in practice 
where it is applied. Applications have been made in widely varied 
fields—the theory of nuclear structure, acoustics, the electromagnetic 
theory of cavity resonators—to mention three. The field I have dis- 
cussed is typical in many respects. One can expect in general that the 
Ritz method can yield extremely accurate results with some problems 
not tractable to other methods of computation, and that its applica- 
tion will become impracticable just in cases where very many inde- 
pendent variables must be considered, or boundary or other conditions 
are especially complicated. What is abnormal about this field is the 
relative completeness with which the Ritz method has been exploited 
in it. This is due just to the scarcity of atomic problems in which one 
can deal with a reasonably small number of independent variables. In 
the majority of fields one can expect to find an unending sequence of 
problems to which the method can be usefully applied. 


PuRDUE UNIVERSITY 


19 J. K. Knipp, Journal of Chemical Physics, vol. 4 (1936), p. 300. 
20H. M. James, Journal of Chemical Physics, vol. 2 (1934), p. 794. 
21H. M. James, Journal of Chemical Physics, vol. 3 (1935), p. 9. 


SOME APPLICATIONS OF CERTAIN POLYNOMIAL CLASSES! 
I. M. SHEFFER 


The term applications will be construed broadly enough as to in- 
clude properties. It is here proposed, then, to examine some properties 
and some applications of some classes of polynomial sets. 


1. Formal properties of Appell sets. The power series approach to 
the theory of analytic functions presents us with a very simple poly- 
nomial set.” Thus, if f(x) is analytic about x =a, we have the expan- 
sion 

= (x — a)” 


(1) Xa 


n=0 


= f(a), 
n! 
in terms of the set 


{ (x a)"/n!}. 


This set has the important property of reproducing itself under the 
operation of differentiation, in accordance with the rule. 


d 
(2) — P,(x) = Pr_1(x), 
dx 


where 
P,=(x—a)*/n!. 


Now this set is not uniquely determined by (2). There are in fact 
infinitely many sets of polynomials { P,} that satisfy (2). These bear 
the name of Appell sets, after the man who in 1880 [1]* first made a 
study of them. Appell sets will be the first class to be considered here. 

There are many conditions that are equivalent to the defining rela- 
tion (2) for Appell sets. Among the simplest are the following two: 


1 An address delivered before the meeting of the Society in Washington, D.C., 
May 3, 1941, by invitation of the Program Committee. 

2 By a set {P,} we understand an infinite sequence Po, P:,---, with P, of 
degree n. 

3 Strictly speaking, Appell’s definition is 


d 
(2/) Pitz) = nP,_1(x), 
dx 
but (2) is preferable for our purpose. If {P,} satisfies (2), then {! P,.} satisfies (2’), 


and conversely. A bibliography on Appell polynomials is given in Davis [5]. 
4 Numbers in brackets refer to the bibliography placed at the end. 
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(i) A set {P,} isan Appell set if and only if constants {a,} exist (ao*0) 
such that 


(This is the explicit representation of an Appell set.) (ii) Im order that 
{P.} be an Appell set it is necessary and sufficient that a formal power 
series 


(4) AW) = 


exist so that (formally) 
(5) = P,(x)t*. 
0 


A(t) will be called the generating function for { P,}. 

It is natural to inquire, regarding a set of polynomials, if it has 
properties analogous to those of the classical sets (Legendre, Hermite, 
and so on). For example, does it satisfy a linear differential equation 
of the form 


Loy + Liy’ + Ley” = ry 


where L; is a polynomial of degree not exceeding 7, and \ is a parame- 
ter which takes on a value X, for the mth polynomial of the set? Or 
an equation of the same type but of higher order? Such equations 
contain too few constants to serve for all Appell sets, although we 
shall see that there is a subclass of such sets that satisfy equations 
of this type. If, however, we are willing to permit the order to become 
infinite: 


(6) y¥(x) = dy(x), 

r=0 
where L, is of degree not exceeding 7, then there is ample freedom not 
only for Appell sets but for all sets. That is, (6) is a universal equation 
for all polynomial sets in that given an arbitrary set [F.3, it is possi- 
ble to choose {Z,}, {d,}, and indeed in infinitely many ways, so that 
{P,} satisfies (6) (y=P, for \=),). 

The advantages of a universal or, as we may say, canonical equa- 
tion are evident. But this very generality may sometimes be a defect 
in studying particular sets, unless it is possible to characterize the 
coefficients {L,} corresponding to such sets. Fortunately a charac- 


x” x? 
n! (mn — 1)! 0! 
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terization can be achieved in the case of Appell sets. The simplest 
(but not the only®) canonical equation of form (6) for Appell sets® is 


[11] 
(7) (bo + + + +--+ = mPy, n=0,1,---. 


That is, Lo=0, Li=bo+x, Li=bis, k22; Moreover, if {P,} 
has the generating function A (#), then {b,} is defined by 


(8) bat" = = 


It follows from (7) that {P,} is an Appell set if and only if a se- 
quence {b,} exists so that 


(9) mP,(x) = (bo + x) + +--+ +bn1Po, m=1,2,---. 


This is of interest because it gives the set {P,,} by recurrence. It has 
been used by Webster [17] to give a simple proof that the Hermite 
polynomial set is the only Tchebycheff (orthogonal) set that is at the 
same time an Appell set. Here then is an Appell set that satisfies a 
second order equation of form (6). In general we have the theorem: 
An Appell set {P,}, with generating function A(t), satisfies a finite 
order equation of type (6) if and only if A(t) has the form 


(10) A(t) = &®, Q(t) = polynomial; 


and the minimum order of all such equations for {P,} is the degree’ of 
Q(t) [11]. As examples, the set {(x—a)"/n!} and the Hermite set 
have for A(t) the respective functions e~*', e~®/4; and these sets sat- 
isfy equations of the first and second order respectively. 


2. Relation of Appell sets to functional equations. In 1888 Pincherle 
made a study of the following linear difference equation with con- 
stant coefficients: 


(11) L[y(x)] = ajy(x + = F(a). 

j=1 
His memoir, in a French translation, was reprinted in 1926 in Acta 
Mathematica [10]. In the course of his investigation he pointed out 
the value of Appell sets. Thus for (11), two Appell sets preéminently 
suggest themselves, namely {P,}, {Q,}, defined respectively by 


5 There are infinitely many. 

6 Conversely, if {P,} satisfies (7), then {¢nP,} will be an Appell set for proper 
choice of the constants {cn}. 

7 The one exception is when Q(t) =constant 
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(12) L[x"/n!] = P,(x), L[Q,(x)] = x*/n!. 


For as Pincherle observed, if F(x) has the expansion 
(13) P(x) = 
0 
then a formal solution of (11) is given by 
(14) y(2) = 
0 


and if F(x) has the expansion (1) (for a=0), then 


0 
formally satisfies (11). For the case that F(x) is an entire function of 
finite exponential type,* Pincherle showed that there is a solution of 
(11) of the same kind. He also found a solution when F(x) is analytic 
at infinity. His method of proof however did not involve polynomial 
sets. 
In discussing the equation 


(16) L{y(x)] = y(x + 1) — y(x) = F(x), 


a particular case of (11), A. Hurwitz [7] used the set {Qn}, which 
for (16) is the set of Bernoulli polynomials. Since series in these poly- 
nomials serve to define only a limited class of entire functions, 
whereas he was interested in the case where F(x) is an arbitrary 
entire function, Hurwitz modified the polynomials by adding to them 
linear combinations of properly chosen exponential functions. The 
device used has been termed the method of expanding contours. For 
from the generating relation 


(17) 


= 


for the Bernoulli set, valid for || <2z, follows the formula 


1 ‘zdt 
(18) Q(2) = — f — 


Qnidc — 1)" 


C being a contour around f=0 and lying in | e| <2. If now C is ex- 
panded so as to include a certain number of poles of the integrand, the 


8 A definition will be given later. 


= 
0 
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new contour, call it C,, will define for each n=0, 1,--- a function 


) 1 e'*dt 

which is Q,,(x) augmented by certain exponentials (the residues at the 
poles lying within C,). Hurwitz proved that to every entire function 
F(x) given by (1) (e=0) there is an entire function solution of (16) 
given by (15) with Q, replaced by Q,*. 

Carmichael [4] extended the method and the conclusion to equa- 
tion (11). The method of expanding contours can be shown to apply 
to (11) and (16) even when F(x) is not an entire function. All that is 
required is that F(x) be analytic about some point in a circle of radius 
exceeding a number r, whose (fixed) value is determined by the equa- 
tion and the method. That is, one obtains what may be called a semi- 
local solution rather than a completely local one. 

This restrictive condition on F(x) can be, and is, lifted by empha- 
sizing the set {P,} rather than {Q,} [12], [13]. For equation (11), 
P,,(x) is given by 


1 
(20) P,(x) = [ay(x + 1)" +--+ + a(x + 


Let the points —w;,7=1,---,k, be plotted. For each x let 
= | x + p(x) = max {p1(z), px(x)}. 


As x varies, there is a unique point x* at which p(x) attains its mini- 
mum value p=p*. This means that the circle of radius p* and center 
x* will cover the set {—w,;}, and that no other circle of equal or 
smaller radius will have this property. 

Now consider (20). It is undesirable, from the point of view of con- 
vergence, to consider those values of x for which the maximum p(x) 
is attained by two or more of the p;(x)’s. The set of such points x, 
which we shall term the critical set, is to be excepted. The following 
results can be proved: Jf the series yon 'c,P,(x) converges at x =Xpo (not 
in the critical set), then it converges absolutely in the circular polygon 


(21) p(x) =| < p(%o), j=1,---,k, 


and uniformly in every closed region therein; and a necessary and suffi- 
cient condition that the above series converge for at least one x not in the 
critical set is that lim sup | ca in < /ip*. 

When such a series converges, and thus converges uniformly in 
some region, the function defined by the series is of course analytic. 
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The converse is also true: If and only if a function F(x) is analytic 
at x=x* can it be expressed as a convergent P,,-series. The manner of 
proof is roundabout. Set F(x) =1/(x—a) in (11), a being a parameter. 
The resulting equation 


(22) L[W(x; = 

x—a 
is shown to have two (or three, according to the case) meromorphic 
solutions, from which follow two (or three) P,-expansions for 
1/(x—a@), valid in some neighborhood of x=x*, for a on two (or 
three) arcs that form a closed contour around x*. Application of the 
Cauchy integral formula then shows that F(x), analytic at x*, also 
has a convergent P,-series. It follows, also, that if F(x) is analytic 
about x* there is an analytic solution of (11) in the neighborhood of x*; 
and a translation of the variable x permits the point x* to be replaced 
by any other point of the plane. 

It should be added that Appell expansions are usually not unique; 
that is, that the function zero possesses at least one P,-expansion in 
which the coefficients are not all zero. To see this we turn back to (5). 
If t9+0 is a zero of the generating function A (¢), then 


(23) 0= > to P(x) 


is an expansion of zero. 
We now consider the more general functional equation 


(24) A[y(x)] = (x) = F(x), 


which is a linear differential equation of infinite order, with constant 
coefficients. Equation (11) is subsumed under (24). For if we call 


(25) A(t) = > a,t* 


the generating function for operator A [y]|, then (11) is that case of (24) 
for which 


k 
(26) A(t) = ajeit. 
j=1 
The Appell set {P,.} defined by (12) likewise has as its generating 
function the A (t) of (25). Now the expansion problem will differ with 
the character of A(t). In general, the larger the class of functions that 


0 
k=0 
0 
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we wish to expand in a{ P,}-series, the greater must be the restriction 
on A(¢). A fruitful assumption, and one that we now make, is that 
A(t) is of finite exponential type.* That is, the coefficients of (25) sat- 
isfy the inequality 


(27) lim sup | #!a,|1/" < 0. 
no 


If this superior limit has the value a, then A(¢) is said to be of ex- 
ponential type o (exp. type a). 

From (6) we see that A (t)e* is, in the variable t, of exp. type not 
exceeding ¢ + |x|, so that lim sup | m!P,(x)|/"<o+ |x|. This is, how- 
ever, too crude an estimate. To better it, we observe that if a function 
C(t) =)0 cat” is of exp. type o, then it is the associated entire function 
(AEF) relative to the function Ci(t) =) !c,t*; and this latter func- 
tion has 1/o as its radius of convergence. Conversely, if by studying 
Ci(t) one can locate the singularity nearest the origin, that will serve 
to determine the type of C(#). 

Applying this principle to (5), it is found [14] on setting 


(28) A(t) = AEF{A,()}, -A*(#) = #A.(2), 
that 

1 n!P,(x) 


|t| sufficiently large. Let G= {a} be the set of all the singularities of 
A*(t) in the whole plane, and define the continuous function D(x) by 


(30) D(x) = max |x+1/a|. 


It can then be established that 
(31) lim sup | #!P,(x) = D(x). 


Let us call the curves D(x) =c the level curves for the set {P,}. A 
point set discussion reveals the following information: There is a 
unique point x* where D(x) has its minimum value D,,. For every 
c>Dn, the level curve is a simple closed convex curve, containing in its 
interior the level curves D(x)=c’ for every c’ in D,,Sc’<c. In the 
case that the points {a} are finite in number, the level curves consist 
of circular arcs. This is the situation relative to equation (11), where 


® See Muggli [9], where other cases are also taken up. The solution of (24) is there 
carried out by the method of expanding contours, and for the case where A(t) is of 
finite exponential type, F(x) is assumed to be an entire function. 
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the region of convergence of a P,,-expansion was found to be a circu- 
lar polygon. 

With respect to the present general case we find that: Jf lim sup | Cn | is 
=p<1/Dyn, then series a(x) converges absolutely interior to the 
level curve D(x)=1/p, and converges uniformly in every closed region 
therein, thus representing an analytic function there. 

At this point we are forced to abandon the method that proved so 
useful in dealing with (11), for we have not sufficient knowledge of 
solutions of the equation corresponding to (22) to follow the earlier 
plan. Turning to the polynomial set {Qn} of (12), and modifying it 
by the method of expanding contours, it is possible, first to solve 
equation (24) semi-locally, and then to obtain a P,-expansion theo- 
rem: If F(x) is analytic about a point x =x 9 in a circle of radius exceed- 
ing \, where d is determined by the operator A, then equation (24) has 
an analytic solution y(x) in some neighborhood of xo; and if F(x) is 
analytic about x =0 in a circle of radius exceeding d* (again determined 
by operator A), then F(x) has a convergent Appell expansion 


(32) F(x) = n'!CnP,(x). 


The numbers X, \* are not sharply defined. Burdette [3] has obtained 
sharp values on making the additional hypothesis that A(t) is 
bounded on a certain infinite sequence of expanding contours. 


3. Sets of type zero. Let us turn back to equation (16). Relative to 
it one can consider the set of Newton polynomials 


x 
(33) N,(x) = 


n! 


for this set, which is not an Appell set, has the fundamental property 
of reproducing itself under application of the difference operator: 


(34) AN, (x) = N-1(2), 


so that if F(x) has the expansion 

(35) F(x) = >> c,N,(2), 
0 

then a formal solution of (16) is given by 
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We may say that {N. a! is a set of difference polynomials, as is every 
other set (and there are infinitely many such) that has the property 
(34). Such sets may be examined as were Appell sets, but at this 
point there is suggested a general problem: Can there be assigned to 
an arbitrary set {P,} an operator J that will reproduce {P,} in the 
sense that 


(37) JI ([P.(x)] = Prs(x) 


for all 
The answer is in the affirmative, and J can be expressed in the form 


(38) Jly] = Ln -1(x)y™ (x) 


where L;(x) is a polynomial of degree not exceeding k. This too is a 
universal (or canonical) operator, since to every set corresponds a de- 
termined J. For Appell sets and for difference sets, the L,’s are con- 
stants. This suggests examination of all sets with constant L,’s. We 
shall say that a set {P.} is of zero type [15] if in its associated operator 
J (as given by (38)) the polynomials L,(x) are all constants. 

For zero type sets, then, (38) can be written 


(39) J[y(x)] = ™(x), 0. 


n=1 


Let the formal power series 
(40) = et" 
1 


be called the generating function for (39), and let H(t) be the inverse 
power series to J(t), so that (formally) 


(41) J(H(t)) = 
Zero type sets can then be characterized as follows: {P,} is a set of 
type zero if and only if a formal power series A (t) =)'s ant” exists so that 


(42) = P,(x)t". 


Observe that the Laguerre polynomials { L(x) } are of zero type, since 
xt 


1 


(43) 


| 
0 
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Various other characterizations of zero type sets can be found. One 


of the simplest is that constants {quo}, {qu} exist so that the recurrence 
relation 


(44) DX (quo + = nP,(x) 

k=1 
holds forn=1,2, - - - . The relation between (42) and (44) is expressed 
by the series 


oo 


= A(t) = 
0 


0 


A'(t) 


(45) a 


Having learned that among Appell sets is a Tchebycheff set (the 
Hermite polynomials) one naturally asks if there are other orthogonal 
sets in the extended class of zero type sets. This problem was solved 
by Meixner [8], by use of the Laplace transformation. He took (42) 
as his definition of the polynomials that he was investigating. Follow- 
ing Meixner, Geronimus obtained some interesting properties of zero 
type sets. A second resolution of the Meixner problem can be made by 
the combined used of (44) and the characterizing recurrence relation 


(46) Q,(x) = (x + An)On—1(x) MnQn—2(x), 


for orthogonal polynomials. For (46) to represent a set of type zero 
it is necessary and sufficient that 


(47) An = a+ bn, Hn = (m — 1)(c + dn), 


with c+dn+0 for n>1. From this will follow the explicit determina- 
tion of the required sets, as (otherwise) obtained by Meixner. 

Consider an operator J of form (39). The equation (24) can be 
equally well expressed in terms of J: 


(48) A[y(x)] = atJ*[y(x)] = F(x), 


where J*[y]=J[J*“[y]], J°[y]=y. As such, we see that polynomial 
sets of zero type suggest themselves as a means of studying equation 
(24). This study, however, awaits more knowledge of the expansion 
properties of such sets. 


4. On equations with non-constant coefficients. The functional 
equations already considered have been of the constant coefficient 
type, and it may be asked if zero type sets can be applied to more 
general equations. Such application has been made to the linear differ- 


7 = 
k=0 
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ential equation of finite order, and this points the way to other equa- 
tions (where, however, the results are as yet only formal). 
Consider the differential equation 


(49) L[y(x)] = y + + + pe(x)y = F(x). 


We seek [16] an operator M[y] that is inverse to L[y] in accordance 
with the relations 


(50) LM[y] =. 
The assumed form of M[y] is 


+00 

(51) M[y] = M,(x)y™(x), 

where 

(52) = fo fo 
70 n-fold) 


It is found that (50) can be satisfied by choosing M,(x) =0, n> —k; 
M_;(x) =1; and the other M’s by recurrence: 
M_m(x) = + + 
+ m > k. 


(The functions C;;(x)are determined by the coefficients p,(x) and their 
derivatives.) On setting 


(53) 


(x — 


(54) H (x,t) = M_.(x) + M_ce+1)(x) +---, 
a solution of (49) is given by 
(55) y(x) = f H(x, #)F(t)dt. 
Zo 
H(x, t) itself satisfies (in the variable x) the homogeneous equation 
(56) L{u] = 0, 
as do also 0H /dt, 07H /dt?, --- , 0*-1H/dt*—!; and these k functions 


form an independent set of solutions. 

The series (54) has an interesting domain of convergence, but we 
must forego a discussion of this [16]. What is of importance for our 
present aim is the observation that the Appell polynomial set 
{(x—t)"/n!} is present (at least, beginning with »=k—1) in this 
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series. Any cther Appell set would (formally at least) lead to the same 
result, in the following sense: If { P(x) } is an Appell, set then the 
formal series 


(57) K(x, t) = t) = t) 


satisfies (56). The convergence problem for (57) has not been ex- 
amined, although relation (31) is certainly applicable. 

With (49) as a guide, we turn to the corresponding difference equa- 
tion 


(58) = A*u(x) + qi(x)A* u(x) + - + qu(x)u(x) = F(x). 


Here we look for an inverse operator M(LM[u]=ML[u]=ux) in the 
form 


+00 
(59) M[u] = M,(x)A*u(z), 
where for negative m the “sum” is meant; that is, A*{[A-*[u]]=u. 
We can again choose M,(x)=0 for n»>—k; M_;(x) =1; after which 
M_ ces (x), M_ce42)(x), - are determined by recurrence, in analogy 
with (53). And like (54) (or (57)) we set up the (formal) series 


(60) K(x, t) = M_:(x)Pia(x — t) + +--- 


where {P,} is any difference set (AP, =P,_1). 

Convergence has not yet been examined. Formally, however, (60) 
satisfies in the variable x the homogeneous equation L[u]=0, as do 
the functions AK, A?K, - - - , where the differences are taken with re- 
spect to the variable ¢. 


5. Remarks on null-functions. We shall close with some remarks 
on the problem of null-functions (as we may call them). Let an infinite 
sequence of functionals {2(,} be given. We assume them to be of the 
form 


(61) mM, [ f(x) Cnof (0) + Cnif (0) Cnof +--- 0, 1, 


supposed applicable to a given class of analytic functions. By a null- 
function we shall understand a function f(x), not identically zero, for 
which 


The determination of null-functions is linked to the problem of find- 
ing the radius of completeness of the sequence of functions { M,(t) :, 
where 


> 
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(63) = Cno + Cnit + Cnol? + ag 
that is, the largest number 7 such that every function f(t), analytic 
in |t| <r, has an expansion 


(64) = 


convergent in | ¢| <r and uniformly convergent in every closed region 
therein.?° 
An interesting subclass of (61) is that in which 2, has the form 


(65) = (0) + Can #0. 
An example of this, namely when 
(66) = f(a), | <1, 0, 1,---, 


leads to the Takenaka problem [18] of determining the largest num- 
ber \ for which it is true that no entire function of exp. type less than 
\ can be a null-function. It is known that A\2log 2, and it is believed 
that \=7/4. The example 


TX 


= 


shows that 2/4 cannot be exceeded. 

Corresponding to the above sequence a, = (—1)” (and it may very 
well be true for every sequence for which {a,} is bounded), the fol- 
lowing is true: Given any o>0, there is only a finite number of 
linearly independent null-functions of exp. type not exceeding o. (As 
o—«, however, the number of independent null-functions is un- 
bounded.) If, however, the sequence {a,} is not required to be 
bounded, and in other cases of the functionals {2,,}, there may be a 
continuous array of null-functions. This fact we shall illustrate by 
borrowing from the theory of sets of type zero. Consider, in fact, the 
set {P,,} for which 


(67) = Ppt; 
0 
or, letting J(t) be the inverse of H(t), 


(68) et = 


10 For work in this direction, consult Whittaker [18], Gontcharoff [6] and Boas 
[2]. These works include further references. 


0 
f(x) 
x) = cos — — sin—-» 
4 4 
0 
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Take [J(t)]* as the function M,(#) of (63), which thus determines the 
functionals A, [f]. 

A simple example is given by H(t) =t=J(t), for then [f] =f (0), 
so that there is no null-function that is analytic at the origin. Now 
take H(t)=log (t+1), J(#)=e'—1. One verifies that the function 
f(x) =e*(1 —e?***) is a null-function for every integer k and for every 
value of a. That is, there is a continuous spectrum of null-functions. 
This property is shared by functionals {A,} corresponding to other 
choices of H(#). 
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PENNSYLVANIA STATE COLLEGE 


ON TRIGONOMETRICAL SERIES WHOSE COEFFICIENTS 
DO NOT TEND TO ZERO! 


R. SALEM 
Let >\p, cos (nx—an) be a series (S) such that 
(1) pr 20, limsupp, > 0. 


It is well known that the set E of convergence of S is of measure zero, 
a result due to Cantor and Lebesgue. More recently Rajchman has 
proved that E is a sum of an enumerable sequence of H-sets, and so in 
particular (an H-set being closed and of measure zero) that E is of 
the first category.” 

We propose to establish a more precise property of the sets E and 
to show the connection between these sets and sets of absolute con- 
vergence (that is, sets in which a trigonometrical series can converge 
absolutely without being absolutely convergent everywhere). 

We propose to call, in memory of Rajchman, “set of the type R” 
any set E such that a series (S) exists which satisfies the condition (1) 
and converges in E. E being a sum of closed sets it is natural to in- 
vestigate the properties of perfect sets of the type R. Let P be such a 
set. For every x belonging to P we must have lim p, cos (nx—a,) =0. 
But by (1) there exists an infinite sequence of integers {n,} such that 
Pn, Hence, 


(2) lim cos (mx — am,) = 0. 


We can assume, without loss of generality that the point x =0 belongs 
to P. Hence 


(3) lim cos an, = 0. 


(2) and (3) give immediately 


(4) lim | sin mx sin an,| = 0, 
and as by (3) 

(5) lim | sin mx cos an,| = 0 
we get, by adding (4) and (5), 

(6) lim | sin mx| = 0. 


1 Presented to the Society, May 2, 1941. 
2 See Zygmund, Trigonometrical Series, Warsaw, 1935, pp. 267-270. 
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This implies, for every x belonging to P, lim sin? n.x=0. Let F(x) be 
any non-decreasing bounded function, constant in each interval con- 
tiguous to P, but not everywhere. The sequence { sin? nix} being uni- 
formly bounded, we have 


lim f sin? ny»x dF = 0 
0 


or 
2r 
(7) lim f cos 2n,x* dF = F(2m) — F(0), 
0 


that is to say, not only the Fourier-Stieltjes cosine coefficients of rank 
2n, of dF do not tend to zero (even if F is continuous), but they tend 
to the greatest possible limit as k— ~. It is interesting to observe that 
the sequence {n,} is independent of the function F. 

It has been proved that if a perfect set P is a set of absolute con- 
vergence, then for every function F of the above-described type 


2r 
lim sup f cos 2nx dF = F(2x) — F(0). 
0 


Thus it appears that sets of the type R and sets of absolute conver- 
gence are closely connected. 

The necessary condition (7) shows that the condition that a set of 
the type R should be a sum of H-sets is too lax. In particular, even 
a single H-set is not, in general, of the type R, as can be seen by the 
example of Cantor’s ternary set C. This is an easy consequence of the 
necessary condition (6) and of the well known fact that the set of all 
possible sums x+y where x and y belong to C fills the whole interval 
on which C is constructed.* 

Moreover, the sum of two perfect sets of the type R may not be 
of the type R. This result which has been proved by Marcinkiewicz5 
for perfect sets of absolute convergence holds good for sets of the 
type R and is a consequence of Marcinkiewicz’s argument and of the 
necessary condition (6). 

Let now P be a perfect set such that the condition (7) is satisfied. 
We can find a function F, bounded, constant in every interval con- 


3 See Salem, The absolute convergence of trigonometrical series, Duke Mathematical 
Journal, vol. 8 (1941), pp. 317-334. 

4 See Zygmund, loc. cit., p. 133. 

5 Travaux de la Société des Sciences et des Lettres de Wilno, Classe des Sciences 
Mathématiques et Naturelles, vol. 12 (1937). 
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tiguous to P, but increasing from one interval to another, and a se- 
quence {m;,} such that /, » sin?’m,x dF <1/k*. Hence 


2r 
sin? dF < ©, 
1 0 


Hence “almost everywhere” in P, that is to say, in a subset P; of P 
such that the variation of F over P—P, is zero, >-f k? sin? mx< ©. 
But 


1/2 


k k 1 \1/2 k 
sin mz| <( sint mz) 
1 1 k? 1 


hence >| sin myx| converges in Pi, and P is “almost everywhere” of 
the type R (and also, almost everywhere, a set of absolute conver- 
gence). 


MONTREAL, CANADA 


UNITARY SPACES WITH CORRESPONDING 
GEODESICS! 


N. COBURN 


1. Introduction. This paper is divided into three parts. In the first 
section, the notation and fundamental concepts of hermitian geome- 
try are reviewed. The second section develops the equations of geo- 
desic curves X; which depend on a real parameter (t) and which are 
imbedded in a unitary space of m-dimensions K,. Our principal result 
is: The equations of such geodesics differ from the equations of geo- 
desics in Riemannian space in that the former contain the torsion 
affinor. In the third section, we classify the connections of two unitary 
spaces K,, ‘K, whose geodesics correspond. First, we find the neces- 
sary and sufficient conditions that two unitary spaces K,, 'K, both 
with symmetric connection shall have their geodesics in correspond- 
ence. This last problem is solved in exactly the same manner as the 
similar problem in Riemannian space.” Secondly, we prove that if K, 
has torsion and ’K, has no torsion (symmetric connection), then their 
geodesics do not correspond. The problem of determining all connec- 
tions of unitary spaces K,, 'K, both with torsion whose geodesics 
correspond is left open. 


1 Presented to the Society, January 1, 1941. 
2 L. P. Eisenhart, Riemannian Geometry, Princeton University Press, 1926, p. 131. 
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2. Notation.’ Consider a real space of 2m-dimensions X2, whose co- 
ordinates are given by the real variables 


Into this X2,, we introduce the complex coordinates given by 

P= 2+ iy, 
(2.2) i 

2 — iy, i= (— 1)", 


Since the Jacobian of this transformation (— 27) does not vanish over 
Xn, then ®, &” constitute a set of 2n independent variables which 
map the X2,. In view of the fact that ” are complex conjugates to *, 
we determine the points of X2, by merely assigning complex numbers 
to &. We say that the & determine “points” which build a complex 
space of m-dimensions X, (the above real topological X>,). Let us de- 
note partial derivatives by 


(2.3) On = = 


and let ¢(£, #”) be an analytic function of the variables #, ®". Then, 
by the composite function theorem, we obtain 


(2.4) = + 

(2.5) 3¢/dy" = id,b — 
Solving for we find 

(2.6) 0,6 = 1/2(0¢/dx* — id¢/d4*), 
(2.7) = + 
If the function ¥(®), 

(2.8) ¥(&) = + Y), 


is analytic in the sense of Cauchy-Riemann, then 
du/dx* — dv/dy = 0, 
(2.9) / /ay 
0v/dx + du/dy = 0. 


By expanding the right-hand side of (2.7), we find that (2.9) is equiva- 
lent to 


(2.10) any = 0. 


From our point of view, this equation follows directly from the fact 


3 J. A. Schouten and D. J. Struik, Einfuhrung in die Neueren Methoden der Differen- 
tialgeometrie, P. Noordhoff, Groningen, Batavia, vol. 2, 1938, p. 225. 
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that #, ®” are independent variables. Hence equation (2.9) merely 
serves to interpret the equation (2.10) from the point of view of com- 
plex variable. If ¥* is the complex conjugate function to y, 


(2.11) y* = u — iv, 


then it is easily shown that (2.9) is equivalent to the so-called con- 
jugate equation 


(2.12) ay* = 0. 


We seek to generalize the idea involved in (2.12). Corresponding to 
any function ¢(@, let o*(#", denote the function obtained 
by replacing 1 by —7. From (2.2), we see that the variables # are 
then replaced by £" and conversely. We call this function ¢*, the con- 
jugate of ¢. In the future, we shall indicate the validity of the conju- 
gate by the abbreviation “conj.” It is to be noted on the formal side, 
that in passing to the conjugate, all indices will be starred. The star 
of a starred quantity removes the original star. 

Consider the allowable analytic coordinate transformations with 
nonvanishing Jacobian 


(2.13) P=P2'), = + iy”. 
The corresponding conjugate equations are 
(2.14) = — iy’, 


We now introduce the unit affinor whose intermediary components 
are 


(2.15) Aw, = fat, Age = OF 


A vector »(#, ®") is said to be of the first type if 
(2.16) v =A)?. 
Associated with each such vector is its conjugate, or vector of the 
second type v*, with transformation law 
x* 

(2.17) 
The theory can be extended to affinors of any mixed valence. 

Let us introduce a connection in X,, by means of the ? quantities 
ry which are functions of position. Then, we define the covariant 
derivative of a contravariant vector by 


(2.18) = dv + de’, conj. 
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The Fr transform in a well known manner‘ under (2.13). We now 
write for a covariant vector 

(2.19) dw, = dw, — , conj. 

By expanding the ordinary differential of a vector, we obtain 

(2.20) = + conj. 


If we define the covariant derivative of , w by means of 


(2.21) = 0,0 + | Vur = OuWr 


(2.22) = VutWr = CODj., 
then 

(2.23) = + conj., 
(2.24) bw, = + Cconj., 


An hermitian X,, with covariant derivative defined by (2.21), (2.22) 
is denoted by K,. 
We now introduce an hermitian tensor with hermitian symmetry 


the (’) indicating the transpose matrix. If we condition a,+ by requir- 
ing that 


(2.26) 6ay,° = 0 => — + dE 


then the space K, is said to be a ahitary K,. For such a space, from 
(2.26), we can prove® 


(2.28) = — = 0, conj. 


The a,+ is now a fundamental tensor and can be used to raise and 
lower indices through the V operator. If we define the contravariant 
fundamental tensor a”* by 


(2.29) a’ = Aj, conj., 

then (2.27), (2.28) may be solved for the connection 
(2.31) Tene = = 


4 J. A. Schouten and D. J. Struik, loc. cit., p. 227. 
5 J. A. Schouten and D. J. Struik, loc. cit., p. 234. 
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Finally, we introduce the torsion affinor 


(2.32) = — = Thay, conj. 


The sign [ ] means that the antisymmetric product of the enclosed 
indices is to be formed; the sign ( ) means that the symmetric product 
of the enclosed indices is to be formed. 


3. Equations of geodesics in K,. Consider the curve X, defined in 
K,, by means of the equations 


(3.1) P= PH), = 


where (#) is a real parameter and #, #” are conjugate functions. Evi- 
dently, such equations determine a curve X, in the real topological 
Xen. We define the element of arc length along this X; by 


(3.2) ds = 


By substituting (3.1) into (3.2) and integrating, we obtain (s) the arc 
length parameter. It is to be noted that this parameter exists only 
because of the fact that (#) is real. By use of (2.25) and the conjugate 
of (3.2), we see that (ds) and hence (s) are real. 

We follow the well known methods of the calculus of variations® in 
finding the geodesics of K,. Our problem is to find that X, for which 
the first variation of 


to 


is zero. Let # represent the coordinates of a point on the geodesic Xi; 
let ‘® represent the coordinates of a point on the varied ’X,; let € be 
an infinitesimal; and let w be an arbitrary function of ®, ®” which 
vanishes at t=to, 41. The equations of variation can be written as 


(3.4) = + ew, conj. 

For clarity, we write 

where 


Then by use of Taylor’s series and integration by parts, we obtain for 
the first variation of (3.3) 


6 L. P. Eisenhart, loc. cit., p. 49. 
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d | Op d ( dg )| 


First, let us assume that the vector (w*, w”) has the components 
(a, b,---,a,6b,--+) where a, b,--- are arbitrary real functions of 
Secondly, let us assume that (w*, w”) has the components 
(ia, ib,---, —ia, —ib,---). Then, by substituting into (3.7), we 
find that this relation decomposes into the two equations 


3.8 —| — dt, 
Lae dt \ae 


By the ordinary argument, we obtain the Euler equations 


(3.10) 
ae \ ab 
3.11 =0 
oe dt (=) 


We follow Eisenhart in listing the steps of our computation. From 
(3.5), we have 


ao 1 1 
(3.12) 
of 2 (ds/dt) 
d ( do ) 
dt \ 
(3.14) 


2 (ds/dt)? 


Substituting these relations into (3.5), we obtain 


as 
(3.15) 


(ds/dt) 
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From (2.30) and (2.31), we find 

(3.16) = 
(3.17) = 
Hence (3.15) becomes 


3.18 
(3.18) 


Transvecting (3.18) with a**, we obtain after combining terms two 
and four 


a* v* ar a* d’s ds 
From the equation (3.11), we obtain the conjugate of (3.19). Hence, 
we have this theorem: 


THEOREM 1. The geodesic X, in unitary K, which are functions of a 
real parameter (t) is given by the solutions of the second order differ- 
ential equation (3.19) and its conjugate. 


Evidently, the equations of the geodesic X, in unitary K, differ 
from the equations of the geodesic V; in V, in the additional term 
containing the torsion affinor. It follows that if the unitary space K, 
has no torsion, then the geodesic X, of K,, satisfies the same type of 
differential equation as the geodesic Vi in V, and conversely. 


4. Unitary spaces with corresponding geodesics. By multiplying 
the conjugate of (3.19) by d,&*, forming a similar equation with the 
indices a, 8 interchanged and then subtracting the two resulting equa- 
tions, we obtain’ for the geodesic X, in K,, 


(ad dt” — + — Tedd 


+ dd dd dd dd’ = 0, conj. 


Consider another n-dimensional unitary space ’K, which is also 
mapped by the variables & such that the point P(&) of K, corre- 
sponds to the point ’P() of 'K,. Furthermore, let the fundamental 
tensor and connection of this space be denoted by 


(4 2) "Oru, 
7L. P. Eisenhart, loc. cit., p. 131. 
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If the geodesics of 'K, correspond to those of K,, then equation (4.1) 
with a, TX,, replaced by ‘ays, ‘TY,, is satisfied by the 
same functions £(¢) which satisfy (4.1), that is, 
~ + (Tad) — dt 
4.3 *a a 
(4-3) +2 aye — = 0, conj. 


Let us first assume that K, and ’K, both possess symmetric con- 
nections. 


THEOREM 2. The necessary and sufficient condition that two unitary 
spaces K,, 'K,, both with symmetric connections have geodesics in corre- 
spondence ts that a vector p, exist such that 


Ty, = + conj. 
We can write the equation 


(4.4) T, =T,+A, , conj., 


where A;;* is an affinor to be determined. Subtracting (4.1) from 
(4.3) (note: the torsion affinors are zero in this case), we obtain 


(4.5) (A, dé dt = 0, conj. 
By use of the unit affinor Af, we may rewrite (4.5) in the simpler form 
(4.6) (A, A, — As dit’ = 0, conj. 


Since (4.6) is to be satisfied by arbitrary values of d,é* at any point 
P(&) and ’P(#), we find 

(4.7) A (ms Ay — Ay) = 0, conj. 
Expanding (4.7), we obtain 

B B B 

(4 8) A Ay + Ar + A oy) Ap 

= A (yy Ay + + Conj. 
Let us write 
(4.9) Aja = (n+ 1)py, conj., 


and contract on the indices yu, a in (4.8). Then we obtain 


(4.10) = 2p conj. 
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Conversely, if (4.10) is satisfied, then (4.6) is identically satisfied. 
Hence our theorem is proved. 

Let us now assume that K, has torsion and ‘K, has no torsion 
(symmetric connection). 


THEOREM 3. Two unitary spaces, K, with torsion, 'K, without tor- 
ston, cannot have their geodesics in correspondence. 


Let us assume the contrary, namely, that the geodesics of these 
two spaces correspond. Then again, we write the equation (4.4) and 
again, we subtract (4.1) from (4.3). In this case, instead of equation 
(4.6), we obtain 

Ay Ay de dE 


+ (0 "A, — = 0, conj. 


Equation (4.11) is to be satisfied by arbitrary values of the d,&. 
Let us first assume that the vector (dé, d,é") has the arbitrary 


real components (a, b, c,---a, b, c,---+) and secondly, let us as- 
sume that this vector has the arbitrary pure imaginary components 
(ta, 1b, ic,--- —ta, —ib,---). By adding the resulting equations, 


we find that (4.11) decomposes into the two equations 


(4.12) (An A, ‘cami: 


(4.13) —@ dif dé = 0, conj. 


The equation (4.12) can be analyzed in an exactly analogous manner 
to that used in the previous theorem excepting that (4.9) must be 
replaced by 


(4.14) Ave = (n + 1)p,, Ae = (n + 1)q,, conj., 


since A,;* is no longer symmetric in v, uw. Instead of (4.10), we find 


(4.15) = 2r, = + Conj. 

In order to analyze (4.13), we write this equation in the form 
(4.16) An = 0, conj., 

where 

(4.17) = (a a conj. 


As previously, let us assume first that (d,£, d.é*") has the arbitrary 
real components (a, b, - - - a, b, - - -). Upon substituting into (4.16), 
we obtain a third degree homogeneous polynomial. We shall explicitly 
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write those terms which contain a, b; the remaining terms are formu- 
lated in a similar fashion. We have 


aB as aB ap 
A + a b(A + + 
2 aB as as aB 
+ ab (A + + A 221°) + +--- =0. 


Secondly, let us assume (d,£, d,¢*") has the arbitrary components 
(a, ib, - --a,ib,---). Then (4.16) becomes 


(4.18) 


+ a bi(— + + 
+ + Ans — Aan) — = 0. 
From these equations, that is, (4.18), (4.19) we conclude 
(4.20) Alte = Ase = Ate = = 0, = = 0. 
We can evidently express (4.20) by writing 
(4.21) Aone = 0, coni. 
By use of (4.17), we find upon expanding 


Placing a equal to uw and contracting, we find 


Solving (4.23) for S;sxs, we find 

(4.24) = SpAss, conj., 
where 

(4.25) = nS,*, CON]. 


Since S;%. is antisymmetric in v*, u* we have 
(4.26) = 0, conj. 
Contracting on a*, u* we obtain the relation 
(4.27) S,- = 0, conj. 


Hence 5S, vanishes. But this is contrary to our assumption. Hence 
our theorem is proved. 
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SUR LES SOLUTIONS DES EQUATIONS DU TYPE 
PARABOLIQUE DETERMINEES DANS UNE 
REGION ILLIMITEE 


MIROSLAW KRZYZANSKI 


L’un des problémes rélatifs 4 l’équation de la chaleur consiste en la 
recherche d’une solution de cette équation, se réduisant 4 une fonc- 
tion donnée sur la caractéristique. Cette solution est donnée par I’in- 
tégrale classique de Weierstrass.! Cependant la question de I’unicité 
de cette solution n’a été complétement résolue que dans les travaux 
récentes de M. M. Picone et M. A. Tychonoff.? 

Or il ya intérét 4 éteindre ces résultats 4 l’équation linéaire du type 
paraboiique: 


m m Ou 
Au(xs, X2,°°* 5 Xm, y) = a;(x1, » Xm, y) 
i, k=l j=l Ox; 
(1) 
Ou 
+ b(x1, X2,°°* Xm, y) ay + X2,° °° » Xm; y)u, 
y 


la forme >-7,~1A4 AA; étant positive définie, en cherchant la solution u 
de cette équation déterminée dans une région illimitée R: OS y<h; 
—o<x;<+o (¢=1,2,---, m), telle que 


Nous commengons par introduire une classification des fonctions 
caractérisant la facon dont elles se comportent 4 |’infini. Convien- 
drons d’appeler dans la suite la classe E, (a >0) une classe de fonctions 
F(x) qui ont la propriété suivante: il existe deux nombres constants 
positifs M et K tels que 


| F(x) | < 


Nous allons voir que dans la classe EZ, par rapport aux x; il existe 
une solution et une seule du probléme posé. 
Le résultat analogue pour |’équation de la chaleur a été obtenu par 


1 Voir par exemple E. Goursat, Cours d’ Analyse, Tome 3, Paris, 1927, chap. 29, 
p. 294. 

2 P. Picone, Sul problema della propagazione del calore in un mezzo privo di frontiera, 
Mathematische Annalen, Tome 101 (1929), pp. 701-712. A. Tychonoff, Théorémes 
d'unicité pour l’équation de chaleur, Recueil Mathématique de Moscou, Tome 42 
(1935), pp. 199-214. Cf. aussi le travail récent de M. F. G. Dressel, The fundamental 
solution of the parabolic equation, Duke Mathematical Journal, Tome 7 (1940), pp. 
186-203. 
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M. Tychonoff (voir le travail cité), qui a démontré en outre que 
l'unicité du probléme n’a pas lieu dans la classe E2;,, ot € est un 
nombre positif arbitrairement petit. 

Considérons l’équation de la forme (1) dont les coefficients sont 
continus et bornés dans la région R; plus précisemment, supposons 
l'existence des nombres positifs A,a et y tels que: | Aiu| <A, |a;| <a, 
<y, (i, k, 7=1, 2, - - - , m), dans toute la région R; quant au coeffi- 
cient b, on suppose I’existence des nombres et tels que 61 >b >B >0 
partout dans R. Tout ceci étant supposé, nous allons démontrer le 
théoréme suivant: 


THEOREME I. La seule solution de I’ équation (1) appartenant 4 la 
classe E2 par rapport aux x; et s’annulant sur la caractéristique y=0 
est u=0. 


D£moNSTRATION. D’aprés la régle générale due 4 M. Picone (voir le 
travail cité) on transforme I’équation (1) en posant u(x1, , Xm, 
=0(x1, X2, Xm, W(x1, Xm, v étant la nouvelle fonc- 
tion inconnue; W étant supposé différent de zéro, on obtient ainsi 


ead 070 q) ov (1) 
(2) Ak — —+b—+e 
jut Ox; dy 


et on doit choisir W de sorte que 1° le coefficient c soit positif dans 
R, 2° 7-0 pour) 7, 

Passons donc 4 la construction de la fonction W conforme 4 notre 
probléme. La fonction u appartenant 4 la classe Ee, il existe deux 
nombres M et Ko tels que 


(3) | u(x1, X2,° Xm, y) | < M exp (Ke 


i=1 


dans R. Soit 


5K (1+ | 
= 
Y 


ou K > Ky et posons 


u(x1, X2,°°*, Xm, ¥) = Xm, x1, , Xm, ¥; K), 
avec 
K Dinix? 
(4) tm 95K) = exp] + vy]. 


Supposons que la hauteur h de R est inférieure 4 1/y. 
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Ainsi dans la transformation de M. Picone nous avons remplacé la 
fonction W par H. Un calcul montre qu’en chaque point de R le coeffi- 
cient c‘ de l’équation (2) est supérieur 4 une forme quadratique en x; 
positive définie, 4 savoir: 


Il en résulte que si l’on construit un parallélipipéde rectangulaire R,, 
détaché de R par les plans x;= +n, la fonction v ne pourrait atteindre 
dans R, ni un maximum positif, ni un minimum négatif que sur la 
partie de sa surface située sur les plans x;= +n et y=0 (voir le travail 
cité de M. Picone). 

Soit Po(x, x, ---, y) un point quelconque de R; pour 
assez grand R, contiendra Po. En vertu de (3) on peut déterminer n 
de sorte que |»| <e sur la surface latérale de Rn, €>0 étant un nombre 
arbitrairement petit. D’autre part v s’annule sur le plan y=0, donc 
|o(x, x0, ---, x, y)| <e. Il enrésulte que v(x, x, -- -, x, 

y) = Oet, par u(x, x, ---, x, y)=0. Comme Py est un 
point arbitraire de R, ona 


u(x1, X%2,°°** 5 Xm, y) = 0 


dans R. 

Passons 4 la démonstration de l’existence de la solution du prob- 
léme, en supposant que les coefficients Ai. et b admettent les dérivées du 
premier ordre continues. 


THEOREME II. La fonction $(x1, x2,°*+, Xm) étant de la classe Es 
et continue, il existe une solution et une seule de Il’ équation (1) apparte- 
nant a la classe E2 en tant que la fonction des x; déterminée dans la 
région R de hauteur h (suffisamment petite) et se réduisant a la fonction 
¢ sur la caractéristique y=0. 


DEMONSTRATION. L’unicité de cette solution résulte aussitét du 
Théoréme I. I] suffit donc de démontrer son existence. 

La fonction @ étant de la classe E2, il existe deux nombres M et K 
tels que 


(5) | o(x1, | < M exp (xx 2). 
i=1 
Construisons la parallélipipéde R,, analogue 4 R, (voir la démon- 
stration du Théoréme I) et soit S, sa surface latérale, composé des 
parties des plans x;= +p. D’aprés les résultats de M. Giraud et 
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M. Gevrey? il existe une solution u, de (1) teile que: 

Cette solution est déterminée dans R,. D’une maniére analogue 
définissons la solution u, de (1), déterminée dans le parallélipipéde 
R, (q>p) analogue 4 R,; soit S, sa surface latérale. 


Supposons que h<2/KA et soient vp, v, et v* les fonctions définies 
par les égalités 


up = vp Xm 2K); 04° H (x, Xm, 2K); 
Ug = (x1, Xm, ¥; K). 
Observons que H(x1, x2, Xm, ¥; 2K) >H(x1, x2, +, Xm, K) 


-exp (K >_™,x?) (K étant un nombre positif) de sorte que 
(6) < exp (- K> 
i=1 


on peut supposer que K<K. D’aprés (5) et la définition de H on 
a <M sur S, et (x1, x2, +, 0)| <M; il en résulte que 
lve | <M partout dans R, et en particulier sur S,. En égard 4 (6) on 
a donc |v,| sur S,. 

La fonction: 


W pq(X1, X%2,°°* Xm; y) X%2,°°* y) 7 °° Xm, y) 


est déterminée partout dans R, et l’on a | wy,| <2M exp (—_Kmp?) sur 
Sp et X2, Xm, 0) =0. Donc 


(7) | 2, °° Xm) | < 2MeKms" partout dans R,. 


Soit maintenant Py un point quelconque de R; considérons un 
parallélipipéde fixe pCR de hauteur h, entourant Py. D’aprés (7) on 
peut déterminer un nombre fp de sorte que |’on ait 


| X%2,° °° » Xm; y) u,(x1, X2,°°* » Xm, y) | <e 


pour g>p>Po partout dans p. Ceci montre que la suite {u,} tend 
vers une limite U(x1, x2, - - - , Xm, y) partout dans R, le point Py étant 
un point arbitraire de R; en particulier U,—U uniformement dans p. 

Nous démontrerons que U est précisemment la solution de notre 


3G. Giraud, Sur certaines opérations aux dérivées partielles du type parabolique, 
Comptes Rendus de l’Académie des Sciences, Paris, Tome 195 (1932), pp. 98-100. 
M. Gevrey, Systémes d’ équations aux dérivées partielles du type parabolique, ibid., Tome 
195 (1932), pp. 690-693. 
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probléme. On a évidemment x2, , Xm, y) =b(%X1, %2, 
il nous reste 4 prouver que U satisfait 4 l’équation (1) partout dans R. 
Il suffit évidemment de démontrer que ceci a lieu au point P. Or soit 
ii(x1, X2,° °°, Xm, y) la solution de (1) déterminée dans p, égale 4 ¢ 
pour y=0 et identique 4 U sur la surface latérale o de p. Nous allons 
voir que #= U dans p. 

En effet, e>0 étant un nombre arbitrairement petit, on peut dé- 
terminer le nombre de sorte que 


(0) (0) (0) (0) (0) (0) (0) (0) 
| %2 ) U(x »%2 )| <«/2, 


| Up(X1, X2,°°* Xm, y) U(x, Xm, y) | < sur ¢o. 
Il résulte de la derniére inégalité, que 


(0) (0) (0) (0) _, (©) (0) (0) 
puisque u,—@# est dans p une solution de (1) et on a u,=%#% pour 
y=0 et #=U sur o. On en déduit |a(xO, x, y) 
— U(x, x, y)| <e donc 
_, @) (0) (0) (0) (0) (0) (0) 
Ainsi U est une intégrale de (1) dans R, c’est donc la solution de 
notre probléme. 
Il nous reste encore 4 démontrer qu’elle est de la classe E2 par rap- 
port aux x,. Or la fonction v*=lim,..,v;* satisfait 4 l’inégalité 
|v*| <M. Comme d’autre part U=v*-H (x1, x2,--+, Xm, y; K), ona 


K 
| U(x1, X2,° °°, Xm; y) | < M exp [A= ry] 


Si la fonction @ est de la classe Ei, on démontrerait aisément, qu’il 
en est de méme de U (par rapport aux x;); plus précisemment, 
l'inégalité |¢| <M exp entraine 


|u| <Mexp x|( |, 


I étant une fonction bornée. En particulier l’inégalité |¢| <M en- 
traine | U| <M dans R. 


WILno, PoLanD 


— 
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ON APPROXIMATION BY EUCLIDEAN AND 
NON-EUCLIDEAN TRANSLATIONS OF 
AN ANALYTIC FUNCTION 


W. SEIDEL AND J. L. WALSH 


In 1929 G. D. Birkhoff established! the noteworthy result that an 
entire function F(z) exists such that to an arbitrary entire function 


g(z) corresponds a sequence a, d2,--- depending on g(z) with the 
property 
(1) lim F(z + an) = g(2) 


for all z, uniformly for z on every closed bounded set. 

It is the object of the present note (a) to indicate that not merely 
an arbitrary entire function g(z) can be expressed in the form (1), but 
also any function analytic in a simply connected region, and (b) to 
study the non-euclidean analogue of the entire problem; precisely 
analogous results are obtained. Some related topics under (a) have 
recently been studied by A. Roth,? who, however, does not mention 
the results to be proved here. 

The immediate occasion of the interest of the present writers? in the 
problem is through (b), for non-euclidean translations have been 
widely used in the study of derivatives of univalent and other func- 
tions analytic in the unit circle |z| =1; limit functions under such 
translations are of great significance in the study of derivatives and 
of limit values of a given function as a variable point z approaches the 
circumference |z| =1. 

We shall give a proof of the following theorem, proof and theorem 
differing only in detail from those of Birkhoff: 


THEOREM 1. There exists an entire function F(z) such that given an 
arbitrary function f(z) analytic in a simply connected region R of the 
z-plane, we have for suitably chosen ai, d2,--- the relation 


(2) lim F(z + an) = f(2) 


for z in R, uniformly on any closed bounded set in R. 


1 Comptes Rendus de I’Académie des Sciences, Paris, vol. 189, pp. 473-475. 

2 Comentarii Mathematici Helvetici, vol. 11 (1938-1939), pp. 77-125. 

% Compare Seidel and Walsh, On the derivatives of functions analytic in the unit circle 
and their radii of univalence and of p-valence, a forthcoming paper in the Transactions 
of this Society. 
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Here and throughout the sequel we are concerned with the finite 
plane, that is to say, the plane of finite points z. 
By way of geometric entities, we introduce the circles Ci: | z—4| =? 


C2: =22,---, C,: |z—4"|=2",---, and also the circles 
r,: |z| =4"+2*+1; it follows that the C, are mutually exterior, 
and that I, contains in its interior all the circles Ci, C2,---, Cy 


but no point in or on any of the circles C41, Cas2, - - - 

Let us enumerate the polynomials in z with rational coefficients: 
pi(z), p2(z), - - - . It is of course true that any sequence of polynomials 
can be replaced by a sequence of polynomials with rational coeffi- 
cients, without altering whatever properties may exist of convergence 
or uniform convergence to a given function on bounded point sets. 

We choose 7(z) as a polynomial in z which satisfies the inequality 
| p1(2—4) —m(z)| <1/2, z on or within C,; indeed we may choose 
™1(z) =p1(2—4). We choose 72(z) as a polynomial in z which satisfies 
the two inequalities 

| m1(z) — m2(z) | < 1/4, z on or within 1, 

| po(z — 4?) — mo(z) | < 1/4, z on or within C2; 

such a polynomial 72(z) exists, by Runge’s classical theorem. In gen- 
eral, let 1,(z) be a polynomial in z which satisfies the inequalities 

| — wa(z)| < 1/2", on or within 

| pa(z — 4") — n(z) | < 1/2", zon or within C,. 

The sequence {7,(z)} converges uniformly in each of the circles I'n; 
hence converges at every point of the plane, uniformly on any 
bounded set. The limit function F(z) is entire, and has the required 


properties. Indeed, let f(z) be analytic in a simply connected region R; 
there exist polynomials p,,(z) of the set already defined with 


(3) lim pas(2) = f(2) 


at every point of R, uniformly on any closed bounded set in R. For z 
in C,: |z—4>| <2" we have 
F(z) = a(z) + [an41(2) — 
+ [n+2(2) n+41(2) | 
| F@) — pale — 4")| S| pale — 4") — | +| — | 
+ | Tn42(2) — n+1(2) | 
1 1 1 
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whence 


(4) lim [F(z + 4") — pa(z)] = 0 

2 
for every 2, uniformly on any bounded set. To return to f(z), we now 
have from (3) and (4) 
(5) lim [F(z + 4%) — f(z)] =0 

for z in R, uniformly on any closed bounded set in R. Theorem 1 is 
established. 

The special case of Theorem 1 that f(z) is an entire function and R 
is the (finite) z-plane is included here, and is the case considered by 
Birkhoff. We add the remark that whenever a function g(z) can be 
represented on a point set E (bounded or unbounded) by a sequence 
of polynomials, that function can also be represented on E in the man- 
ner indicated by (2), with preservation of the property of uniform 
convergence whenever that occurs on a bounded set belonging to E. 
For instance, E may consist of a sequence of disjoint simply connected 
regions Ri, Re, ---, with g(z) analytic on E; then g(z) can be repre- 
sented on E either by a sequence of polynomials or, as in (2), with 
uniform convergence on any closed bounded subset of E. On the gen- 
eral subject of representation by polynomials there exist modern re- 
searches due to Montel, Walsh, Hartogs and Rosenthal, and Lavren- 
tieff.4 

A further remark in connection with Theorem 1 is that if the num- 


bers Ao, A1, A2,--- are arbitrary, there exists a sequence a1, a2,--- 
with the property 
(6) lim F“(a,yY = Ax, k = 0, 1, 2,- 


To establish (6) it is sufficient to remark that when m is given, the 
number a,, exists with the property 


Ae | 1 
2! m! 2™-m! 


for |z| < 1; 


F(z + Gu) — E + 


from Cauchy’s inequality it then follows that we have | F“ (am) —Az| 
<1/2",k=0,1,2,---,m; the relation (6) follows. 

‘ The reader may refer to Lavrentieff, Sur les Fonctions d'une Variable Complexe 
Représentables par des Séries de Polynomes, Actualités Scientifiques et Industrielles, 
no. 441, Paris, 1936. 
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We turn now to the non-euclidean analogue of Theorem 1: 


THEOREM 2. There exists a function ®(z) analytic in the region 
| 2| <1 such that given an arbitrary function $(z) analytic in a simply 


connected subregion R, we have for suitably chosen oy, a2,--- the rela- 
tion 
2+ a, 
7 lim @{ — = 3), 


for 2 in R, uniformly on any closed set interior to R. 


As in Theorem 1 we needed to use only real a,, so here we shall 
actually employ only real a,. 

For geometric entities we choose here (C; as the n.e. circle of n.e. 
radius 2 whose n.e. center is the point z=; of the axis of reals whose 
n.e. distance from z=0 is 4, and in general choose C, as the n.e. circle 
of n.e. radius 2" whose n.e. center is the point z=, of the axis of reals 
whose n.e. distance from z=0 is 4”. Let T’,, be the circle whose center 
is z=0 and n.e. radius 4*+2"+41, so that I, contains in its in- 


terior all the circles Ci, C2,---, C,, but no point in or on any of 
the circles Casi, Caso, - 
As before, we use the polynomials p:(z), p2(2),--- with rational 


coefficients. Choose 7:(z) as a polynomial in z which satisfies the in- 
equality 


( z— fy ) | 
pil — _ ] | < 1/2, z on or within C;; 
1 — Biz 


choose 72(z) as a polynomial in z which satisfies the two inequalities 
| — | < 1/4, z on or within Tj, 

— 
— Boz me) 


In general, let 7,,(z) be a polynomial in z which satisfies 


< 1/4, z on or within C2. 


| — | < 1/2", z on or within I',_1; 


| Pn (--*) — | <1/2", zon orwithin C,. 
1 — Biz 

The sequence {z,(z)} converges uniformly in each of the circles In, 
hence converges at every point of the region | s| <1, uniformly on any 
closed subset. The limit function ®(z) is analytic throughout the re- 
gion |z| <1, and will now be shown to have the required properties. 
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For z in C, we have 
+ — +---, 


| _| s— 


+ | — | 4... 


1 1 
whence 
z+ Bn 
8 li } — Pn =0 


for every z in | z| <1, uniformly on any closed set in | z| <I; 
Let ¢(z) be analytic in the simply connected region R in | z| <8 
There exist polynomials p,,(z) of the set already defined with 
(9) lim pn,(2) = $(2) 
ny 
at every point of R, uniformly on any closed subset. From (8) and (9) 
we find 


(10) lim ~ | =0 
ne 1 + 
for zin R, uniformly on any closed set in R. 
Theorem 2 is established. The region R may in particular be the 
region | z| <1. Remarks for Theorem 2 entirely analogous to those for 
Theorem 1 are also valid. 


UNIVERSITY OF ROCHESTER AND 
HARVARD UNIVERSITY 


ZERO-DIMENSIONAL FAMILIES OF SETS! 
SAMUEL EILENBERG AND E. W. MILLER 


A family = { A,} of subsets of a topological space X will be called 
0-dimensional if given an open set U such that A,,CU, there is an 
open set V such that (1) Aa,CVCU and (2) (V—V)>\.A2=0. We 
enumerate below a few of the most common 0-dimensional families. 
In each case the proof of 0-dimensionality is easy, and is therefore 
omitted. 

(I) Every family of disjoint open subsets of a topological space is 
0-dimensional. 

(I1) Let Y be a locally connected subset of a topological space X. 
The family ® of the components of Y is 0-dimensional. 

(III) Let Y be a compact and closed subset of a metric space X. 
The family ® of the components of Y is 0-dimensional. 

(IV) Let Y bea subset of a metric space X. The family ® consisting 
of the individual points of Y is 0-dimensional if and only if dim Y=0. 

(V) Let ® be a family of closed subsets of a compact metric space 
X. If, given any sequence F, F;, F2,--- of sets of ®, the relation 
F-lim inf F;#0 implies lim inf F;C F, then the family ® is called up- 
per-semi-continuous. In this case the sets of the family ® are disjoint. 
There is a standard way of introducing a topology into the family ® 
which leads to a separable metrizable hyperspace $*. The family @ is 
0-dimensional if and only if dim #*=0. In particular, ® is 0-dimen- 
sional whenever it is upper-semi-continuous and countable. 

(VI) Let Y be a subset of a topological space X and let Y be 
homeomorphic with a subset of the linear continuum. The family ® 
of the components of Y is 0-dimensional. 

The purpose of this note is to establish the following theorem: 


THEoREM. Let X be a unicoherent Peano continuum,? 6={A,} a 
0-dimensional family of subsets of X, and x; and x2 two points of X. 
If none of the sets A, cuts X between x, and x2,? then ) «Aq does not cut X 
between x, and x2. 


Various corollaries can be obtained by taking X to be the m-sphere 


1 Presented to the Society, December 26, 1939, under the title On 0-dimensional 
upper-semi-continuous collections. 

2A Peano continuum X is called unicoherent if given any decomposition 
X =Xi+X; into continua, the set X,-X2 is a continuum. 

3A set ACX cuts X between x; and x2 if X —A contains no continuum joining 
and x2. 
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S" (n>1) or the n-cube Q* (n>0) and ® to be one of the families 
(I)-(VI).4 

We shall establish two lemmas before giving the proof of the theo- 
rem. 


Lemma 1. Let X be a unicoherent Peano continuum. Let A, and Az be 
open and disjoint subsets of X, and let Az be connected. If neither A, 
nor Az cuts X between the points x; and x2, then A,+Az does not cut X 
between x; and x2. 


Proor. Let C2 be the component of X — Az which contains x, and xe. 
Let and —C,. It follows easily that B,; and are 
open, B,-B.=0, and X—B,=C, are connected, and 
CB,+Bz. It will be sufficient to show that B, +B, does not cut X 
between x; and xe. 

Since B,CAy,, the set B,; does not cut X between x; and x2. We will 
denote the component of X —B, which contains x, and x2 by C.. If 
C,-Bz=0, then —(B,+B:2), so that +B. does not cut X be- 
tween x; and x2. We will suppose then that C,-Be2¥0. Since C; 
and Be. are connected it follows that Ci+Bs is connected. But 
C:+B.CX—B;. Hence (;+BeCC;, and therefore Since 
C2=X — Bo, it follows that X =C,+ C2. Since and C2 are continua 
and X is unicoherent, it follows that C,-C: is a continuum. But x, 
and x2 belong to Ci- C2, and X Hence B, +B: does 
not cut X between x; and xe. 


LemMA 2. Let X be a unicoherent Peano continuum and let 
A;, Ao, -++,An,*** bea sequence of disjoint open subsets of X. If 
none of the sets A, cuts X between x; and x2, then 5 i n does not cut X 
between x; and x». 


Proor. Since X is locally connected and separable, every open set 
in X consists of a countable number of components each of which is 
open. It is clear, then, that we may assume each set A, to be con- 
nected. 

Let k be any positive integer. Using Lemma 1, it follows by finite 
induction that --~- +A; does not cut X between x; and 
Let C; be the component of X —(A1+A2+ --- +A;) which contains 
x, and x2. Then []#.,C; is a continuum containing x; and x2, and 
Ar. Hence does not cut X between x, 
and xe. 


4 See R. L. Moore, Proceedings of the National Academy of Sciences, vol. 10 
(1934), p. 356, and S. Eilenberg, Fundamenta Mathematicae, vol. 26 (1936), pp. 76- 
77. 
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We now return to the proof of the theorem: 

For any F in © there is an open set U(F) such that FC U(F), and 
U(F) does not cut X between x; and x2. Since ® is 0-dimensional, there 
is an open set V(F) such that FC V(F)C U(F) and [V(F)— V(F) |>-F 
=0. By the Lindeléf covering theorem, there is a sequence F;, Fs, - - - 
of elements of ® such that > FC>>2.,V(F;). Now let 


A, = V(F)), As = V(F2) V(F)), 
A, = V(Fi) — +--- 


The sets Ai, Az,---, Ax, are open and disjoint, and no one of 
them cuts X between x; and x2. But, as is easily shown, FC) 
Hence in view of Lemma 2, >, F does not cut X between x; and x». 


THE UNIVERSITY OF MICHIGAN 


SUMS OF FOURTH POWERS OF GAUSSIAN INTEGERS 
IVAN NIVEN 


It is the purpose of this note to give necessary and sufficient condi- 
tions for the expressibility of a Gaussian integer as a sum of fourth 
powers of Gaussian integers; and then to determine an upper bound 
to the number of fourth powers necessary when the conditions are 
satisfied. Our results are as follows: 


THEOREM. A Gaussian integer is expressible as a sum of fourth powers 
of Gaussian integers if and only if its imaginary coordinate is divisible 
by 24. Every integer a+24b1, where a and b are rational integers, is ex- 
pressible as a sum of 18 or fewer fourth powers. 


First we prove that the condition is necessary. We note that! 
(1) (x + yi)* = xt — 6x?y? + y* + 4ixy(x? — 


It is obvious that xy(x?—y?) is divisible by 2 and by 3. Hence any 
fourth power has an imaginary coordinate divisible by 24, and any 
sum of fourth powers has the same property. 

The converse of this is included in the second statement in the 
theorem, which we now proceed to prove. Th» author? has shown 
that a Gaussian integer a+ 207 is expressible as a sum of two squares 


1 Latin letters will represent rational integers throughout this paper. 
2 Integers of quadratic fields as sums of squares, Transactions of this Society, 
vol. 48 (1940), p. 410, Theorem 2. 
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if and only if not both a/2 and 3b are odd rational integers. We use 
this result in the following two lemmas. 


Lemma 1. The Gaussian integer 6(c+-di)?, where c and d are odd and 
even, respectively, is expressible as a sum of six fourth powers. 


By the theorem stated above, we can write c-+di=a?+?, where a 
and 8 are Gaussian integers. Also we employ the identity 


6(a? + B*)? = 2(a + B)* + 2(a — B)* + (a + Bi)* + (a — Bi)’, 
which completes the proof. 


LEMMA 2. A Gaussian integer of the form 48h+-6k+24mi, where k 
equals 2 or 6 according as m is even or odd, ts expressible as a sum of 
twelve fourth powers. 


As above, we can write 


(2) 8h + k + 4mi = (c + di)? + (e + fi)’, 
which implies 
(3) — — f*, 2m = cd + ef. 


Since 8h+k=2 (mod 4), the first of equations (3) shows that either 
c and e are odd and d and f are even, or vice versa. We show that the 
latter cannot be the case. 

First let m be even, so that k=2. Then equations (3) imply the 
congruences 


(4) c? + e? — d? — f? = 2 (mod 8), cd + ef = 0 (mod 4). 


Since the square of an odd integer is congruent to 1 modulo 8, the as- 
sumption that d and f are odd leads to the congruence c?+e?=4 
(mod 8). Hence the integers c and e are even, but are incongruent 
modulo 4. But these conditions on c and e are incompatible with the 
second of the congruences (4). 

On the other hand, if m is odd, k=6, and corresponding to (4) we 
have the congruences 


(5) c? + e? — d? — f? = 6 (mod 8), cd + ef = 2 (mod 4). 


Assuming again that d and f are odd, we obtain c?+e?=0 (mod 8). 
Hence c and e are even, and, moreover, are congruent modulo 4. These 
conditions are such that the second of congruences (5) has no solu- 
tions. 

Thus we have proven that d and f are even, and c and e are odd. 
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Multiplying equation (2) by 6, we see that Lemma 1 is applicable to 
each term on the right side, and we have the desired result. 
To complete the proof of the theorem, we prove this lemma: 


LEMMA 3. Any integer a+24bi is expressible as a sum of six fourth 
powers and an integer of the type described in Lemma 2. 


We identify 6 with m, and show that a is congruent to six real 
fourth powers modulo 484+-6k. This we do by exhibiting the numbers 
0,1, ---,47 as sums of fourth powers modulo 48. Most of these can 
be handled by the use of: 14=1, 2=16, (1+7)4= —4. Except for the 
values 7, 22, 23, 27, 37, 38, 39, 42, 43, and 47, all integers from 1 to 47 
can be expressed as sums of 1, 16, and —4, not more than six sum- 
mands being used in each case. For example, we have 


11=16-—4—4+1+1+1, 4=164+ 16+ 16—44-1+1. 


Turning now to the exceptional cases, we make use of 81 = 34, and 
introduce congruences modulo 48. We can write 


7= 151 = 81+ 81-—4-4-—-4+1, 
22 =70 = 
27 = 75 = 
37 
47m —-1=2 —44+1+1+1, 


these being congruences modulo 48. Since the integer 37 is represented 
here as a sum of four fourth powers modulo 48, the integers 38 and 39 
are similarly sums of five and six fourth powers, respectively. Also 
the integers 23 and 43 can be compared with 22 and 42 above. 
Although the theorem has been proved completely, we now show 
that Lemma 3 cannot be improved, that is, that six fourth powers are 
necessary in at least one case. Consider the situation when a = 19+48A, 
b=2B. In this case we can show that it is not possible to obtain ra- 
tional integers h and m, and Gaussian integers a, - - - , as to satisfy 


5 
(6) a + 24bi = Do a; + 48h + 6k + 24mi, 
t=1 


where, of course, & is 2 or 6 according as m is even or odd. 


Lemma 4. If the imaginary part of the fourth power of a Gaussian 
integer ts congruent to 0 modulo 48, the real part is congruent to 0, 1, or 12 


= 
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modulo 16. If the imaginary part ts congruent to 24 modulo 48, the real 
part is congruent to 9 modulo 16. 


From equation (1) it follows that 
R(x + yi)* = (x? — y*)? — I(x + yi)4 = 4xy(x? — y?). 


Since the square of a rational integer is congruent to 0, 1, 4 or 9 
modulo 16, it is easily verified that R(x+-yi)‘ is congruent to 0, 1, 9, 
or 12 modulo 16. Also note that R(x +i)‘ is congruent to 9 modulo 16 
if and only if x is odd and y is congruent to 2 modulo 4, or vice versa; 
and when x and y satisfy these conditions, I(x+-yi)‘ is congruent to 
24 modulo 48. In all other cases either xy or x?—y? is divisible by 4, 
and hence I(x+yi)‘ is congruent to 0 modulo 48. 

We now use Lemma 4 to prove equation (6) impossible. First let m 
be even, so that k=2. Since D is even the imaginary parts of (6) can 
be equal only if }va# has an imaginary coordinate congruent to 0 
modulo 48. This implies that an even number of the terms I(a#) are 
congruent to 24 modulo 48. In turn, Lemma 4 states that an even 
number of the terms R(a#) are congruent to 9 modulo 16. The real 
parts of equation (6) can be written as a congruence: 


5 
19 = >> R(ax) + 12 (mod 16). 
i=1 

Simple verification shows that this congruence cannot be satisfied by 
assigning the values 0, 1, 9, or 12 to the terms R(af), the value 9 being 
used an even number of times. 

Second let m be odd, so that k =6. Corresponding to the above con- 
gruence we have 


5 
19 = >> R(a;) + 36 (mod 16). 
i=1 


In this case, however, Lemma 4 requires that we have an odd number 
of the terms R(af) congruent to 9 modulo 16. Again it can be verified 
that the congruence has no solutions satisfying this condition. 
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THE GEOMETRY OF WHIRLS AND WHIRL-MOTIONS 
IN SPACE! 


J. M. FELD 


1. Introduction. The geometry of whirls and whirl-motions in the 
plane was inaugurated by E. Kasner.? An adaptation of Kasner’s ge- 
ometry to the sphere was made by K. Strubecker.? It is the purpose 
of this paper to develop a strictly analogous geometry in euclidean 
three-space, S;. To render such a development possible we shall in- 
troduce a new type of oriented plane element—namely, a geometric 
object formed by a plane, a point in the plane, and an ordered pair 
of orthogonal fundamental directions in the plane; the fundamental 
directions shall be given by a pair of unit vectors in the plane. There 
are 2«® such plane elements in $3; there are ©* of the Lie kind. 
Henceforth, plane element shall mean only the new kind of plane ele- 
ment.4 


2. Turns, slides, and direct whirls. Let ¢o, ¢:, ¢2, es be the Hamil- 
tonian quaternion units such that 


2 3 2 
Cots = Cito = Ci, C1 = Co = C3 = C1€oe3 = — 1. 


Let any real point P in S; whose orthogonal cartesian coordinates are 
21, 2, 23 be represented by the position vector z=21¢:+22¢2+23¢3. Let 


1 Presented to the Society February, 22, 1941. 

2? Edward Kasner, The group of turns and slides and the geometry of turbines, 
American Journal of Mathematics, vol. 33 (1911), pp. 193-202. Further development 
of the subject appears in a series of papers by Kasner and De Cicco; see, for example, 
their joint papers, Quadratic fields in the geometry of the whirl-motion group Gg, ibid., 
vol. 61 (1939), pp. 131-142; and The geometry of the whirl-motion group Gg: elementary 
invariants, this Bulletin, vol. 44 (1938), pp. 399-403. 

3 K. Strubecker, Zur Geometrie spharischer Kurvenscharen, Jahresbericht der Deut- 
schen Mathematiker-Vereinigung, vol. 44 (1934), pp. 184-198. 

4 Inasmuch as our oriented plane element defines a position of a rigid body in space, 
it is essentially equivalent to Study’s soma, Geometrie der Dynamen, Leipizg, 1903, 
and to De Saussure’s feuillet, Exposé Résumé de la Géométrie des Feutllets, Geneva, 
1910; see also R. Bricard, Nouvelles Annales de Mathématiques, (4), vol. 10 (1910). 
It was remarked by Kasner in his 1911 paper, loc. cit., that it was possible to obtain 
in any space of constant curvature a group analogous to his group of whirls in the 
plane and that, moreover, for ordinary space the feutllet, consisting of a point, line, 
and plane all incident with one another, would be an appropriate element. Another 
generalization of Kasner’s turbine geometry, along lines different from those pursued 
in this paper, has been carried out by A. Narasinga Rao, Studies in turbine geometry 
I, Journal of the Indian Mathematical Society, vol. 3 (1938), pp. 96-108; II, Pro- 
ceedings of the Indian Academy of Sciences, vol. 8A (1938), pp. 179-186. 
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OR be an axis through the origin parallel to a given unit vector 2, 
and let 


= —cos0/2+vsin0/2, = —cos0/2 —vsin 6/2. 


Then the rotation of z around OR through the angle @ is given by 
Hamilton’s formula® 


(2.1) = f2f. 


We shall refer to (2.1) as the rotation ¢. 

Let € be a plane element characterized by the point z lying in the 
plane P and the pair of fundamental orthogonal directions given by 
the unit vectors 7, v2, in this order. Let ¢ be the (unit) quaternion by 
means of which the vectors ¢: and ¢, can be rotated around an axis 
passing through the origin into positions parallel respectively to the 
directions of 7; and ve. There evidently exists a (1, 1) correspondence 
between the plane elements € and the pairs of quantities z, ¢; we 
shall, accordingly, call z, § the coordinates of €, and shall use the sym- 
bol (z, ¢) to designate the plane element whose coordinates are 2, ¢. 


DEFINITIONS. A slide is a plane element transformation, (z, £) 
—>(z*, ¢*), which leaves the planes of the plane elements unaltered, but 
subjects their points to the same translation relative to their fundamental 
directions. 

A turn is a plane element transformation which leaves the points of 
the plane elements unaltered, but rotates their planes through the same 
angle around an axis inclined in the same manner relative to their funda- 
mental directions. 

A direct whirl® is a plane element transformation resulting from the 
product of an arbitrary turn followed first by an arbitrary slide and then 
by a second arbitrary turn. 


These definitions are the space analogues of those given by Kasner 
for the plane. 

Let +202, (a; real); then the slide (z, ¢)—>(z*, ¢*), which 
shall be represented by S,, is given by the equations: 


5 It will suffice for our purpose to restrict ourselves throughout this paper to unit 
quaternions—that is, those having a norm equal to unity. 

6 The term direct whirl is used here instead of Kasner’s whirl because it is necessary 
to distinguish this from other types of whirl that appear below. This term was first 
used, for a similar reason, in J. M. Feld’s Whirl-similitudes, euclidean kinematics, and 
non-euclidean geometry. (This Bulletin, abstract 46-5-270.) 
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Let a=apeo+aititarte+aze3 and N(a)=1; then the turn T, is 
given by the equations: 


(2.3) =z, = af. 
The direct whirl T,S.7¢ is given by the equations: 
= faaat, = Bat. 


If we let aaa =c and Ba =¥, the equations of a generic direct whirl W, 
assume the form: 


(2.4) s=stic, f*=r7t 


where c is a vector and ¥ is a unit quaternion. 
We can now state the following theorems: 


THEOREM 1. The slides constitute a continuous two-parameter abelian 
group such that Sa- Sp=Sa+s. 


THEOREM 2. The turns constitute a continuous three-parameter group 
such that 


THEOREM 3. The direct whirls constitute a continuous six-parameter 
group. A necessary and sufficient condition that Wig is that 
c=a+aba and y=Ba. 


We shall let Wf represent the group of direct whirls. Noting that 
the direct whirls Wi. where a= ¢o, a=ke; (k real), form a one-param- 
eter continuous group of contact transformations—namely, the group 
of dilatations D,—we obtain the following theorem: 


THEOREM 4. Direct whirls can be uniquely represented in the form 
We, where c=kes+b, y =a. 

From equations (2.4) we obtain, by multiplying each member of 
the first equation on the left by the corresponding member of the 
second equation and on the right by the conjugate of the correspond- 
ing member of the second equation, {*2*f*=y(f2t)7y+ycy. Letting 
Z* ={*2*t*, a=ycy, we have 


(2.5) Z* = aZa+a, 


which represents a euclidean point displacement Z—Z* in S;. Let 
this displacement be represented by Df,. Then we obtain this 
theorem: 


THEOREM 5. The group of direct whirls is simply isomorphic to the 
group of euclidean displacements in S3: the correspondence between mem- 


bers of these groups is given by where b=aaea and B=a. 
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3. The group of whirl-motions. Equation (2.5), which represents a 
point transformation, can be extended to represent a plane element 
transformation (z, ¢)—>(z*, ¢*) as follows: 


(3.4) 2* = aza + a, = te. 


Henceforth, Dy, shall designate the euclidean displacement of plane 
elements given by (3.1); Dé shall designate the six-parameter group 
of these displacements. The product of a displacement Dy, and a di- 
rect whirl W;f,, in either order, shall be called a direct whirl-motion. 
The equations of the direct whirl-motion D?,-W¢, are: 


(3.2) = a(z + +a, = "Bro: 


Evidently Di,-Wiis=Wig-Di.. The direct whirl-motions consti- 
tute a continuous twelve-parameter group, which shall be designated 
by Gi. 

Let / and r be an arbitrary pair of vectors, and let the plane ele- 
ment be subjected to the possible rotations around 
the point whose position vector is equal to ]; then the manifold of «# 
plane elements (z, ¢) into which € is thereby transformed shall be 
called a (space) turbine. The equation of this turbine is 


(3.3) z—1= fre. 


Since the vectors / and r completely characterize the turbine (3.3), 
we shall call / and r its left and right coordinates respectively, and let 
the symbol [/, r] represent the turbine whose equation is (3.3). A 
brief computation will verify this theorem: 


THEOREM 6. Turbines are transformed into turbines under @},. Under 
W,t their left coordinates remain invariant; under D¢ thetr right coordi- 
nates remain invariant. 


Let us now consider a space analogue of the flat field, a concept that 
appears in Kasner’s whirl-motion geometry in the plane. We shall call 
this analogue a dual element because it will serve as the dual of the 
plane element, just as the flat field serves as the dual of the lineal 
element.’ Dual elements shall be represented by two coordinates s 
and o where the former is a vector and the latter is a unit quaternion; 
in order to distinguish dual elements from plane elements, the former 
shall be represented by the symbol {s, ¢}. We define dual elements 
by imposing upon them the requirement that the turbine [/, r] which, 
in plane element coordinates z, ¢, has the equation (3.3) shall, in dual 
coordinates s, ¢, have the equation 


7 Feld, loc. cit. 


E 
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(3.3*) s—l= are. 


A turbine [/, r] and a plane element (z, ¢) shall be said to be incident 
if their coordinates are connected by equation (3.3); similarly, [1, r] 
and {s, a} shall be said to be incident if their coordinates are con- 
nected by equation (3.3*). 

It is evident that direct whirls and displacements of plane elements 
induce contragredient transformations of the dual elements; for ex- 
ample, the dual element transformation contragredient to Wi, is 


(3.4) s* = 5 — gage, o* = ao. 


This can be regarded, of course, as a representation of Wi, in dual 
element coordinates; likewise, displacements and whirl-motions can 
be expressed in dual element coordinates. 

Let $1 represent the involutory plane element transformation 


(3.5) =f, 
which induces the dual element transformation 
(3.5*) st=—s5, ot =o. 


The ~* transformations 3:8¢ or WF: shall be called opposite whirls; 
the family of opposite whirls shall be designated by We. Similarly, 
the family $:D¢=D#3:, which embraces the * euclidean symme- 
tries in S;, shall be designated by Df; and the twelve-parameter 
family yielded by the products of $1 and the direct whirl-motions 
shall be called the family of opposite whirl-motions, and shall be repre- 
sented by Gj,. Evidently, Dr We =DF We = G},. Since the product of 
two opposite whirl-motions is a direct whirl-motion, the two families 
@}, and G?, form a mixed group. 

Let $2 represent the involutory correlation (z, Hof{s, a} given by 
the equations: 


(3.6) 's, 

Let the family of transformations embraced by $2G}, = Gi,%2 be repre- 

sented by G3; likewise, let $2G?, = G?,32 be represented by Gf. The 

four families Gj, (¢=1, 2, 3, 4), constitute a mixed group, which shall 

be designated by I» and called the group of proper whirl-motions. 
Let $3 represent the involutory plane element transformation 

(z, £)—>(z*, ¢*) given by the equations: 

(3.7) fet, =F. 


This transformation induces the dual element transformation 


(3.77) s* = — o* = 
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Let the products $;@{,=Gi,%3, (c=1, 2, 3, 4), be designated by $jp. 
The four families S{, embrace the improper whirl-motions. The proper 
and improper whirl-motions constitute a mixed twelve-parameter 
group composed of eight mutually exclusive families; this group, 
which shall be designated by Tz, shall be called the complete group 
of whirl-motions. The following theorems are now evident. 


THEOREM 7. Turbines are transformed into turbines under the com- 
plete group of whirl-motions. 


THEOREM 8. The turbine transformation effected by 33 is given by 
=]. 


4. The kinematic representation. Let the turbine [/, r] be mapped 
on that ordered pair of points in S; whose position vectors are / and r, 
in this order. By means of this mapping, which shall be known as the 
kinematic representation,® a (1, 1) correspondence is set up between 
turbines [/, r] and ordered point pairs /, r, the former of which is the 
left image point and the latter the right image point cf [/, r]. If we 
regard, as we may, the 81? whirl-motions in Ty as turbine trans- 
formations, [/, r]—[/*, r*], we find that the kinematic representation 
maps these transformations (1, 1) upon pairs of euclidean motions 
(displacements and symmetries) in $3, which are either of the type 
l—l*, r—r*, or else of the type ]—r*, r—/*. The pairs of motions of 
the former type correspond to the proper whirl-motions, and those 
of the latter type to the improper whirl-motions. Explicitly, the kine- 
matic images of the families Gj, are as follows: 


Gre: = ala+a, r* = BrB + 
(4.1) Gis: = —a@la+a, r* = — BrB +b, 
Gi: I* = alata, * = — BrB +5, 


= 
a and B are unit quaternions; a and b are vectors. 


To obtain the kinematic image of the family ${, we need but, in ac- 


8 A similar representation of whirl-motion geometry in the plane has been given 
by Feld, loc. cit. See also, in this connection, J. Griinwald’s Ein Abbildungsprinzip, 
welches die ebene Geometrie und Kinematik mit der réumlichen Geometrie verkniipft, 
Sitzungsberichte der Akademie der Wissenschaften, Vienna, Mathematisch-Natur- 
wissenschaftliche Klasse (II A), vol. 80 (1911), pp. 677-741; and two contributions by 
W. Blaschke, Euklidische Kinematik und nichteuklidische Geometrie, Zeitschrift fiir 
Mathematik und Physik, vol. 60 (1911), pp. 61-91, 203-204; and Ebene Kinematik, 
Berlin, 1938. 
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cordance with Theorem 8, interchange / and r in the pair of equations 
given by (4.1) which corresponds to @¥,. 

Inasmuch as the turbine [/, r] has for its equation in plane element 
coordinates z, ¢: = —{r{+z, and for its equation in dual element co- 
ordinates s,¢:1=ara0-+s, it is apparent that a necessary and sufficient 
condition that [/, r] be incident to a given plane element (z, ¢) [dual 
element {s,}] is that the left and right image points of [/, r] in the 
kinematic representation be corresponding points in a euclidean sym- 
metry [displacement]. Consequently, the kinematic representation 
establishes a (1, 1) correspondence between plane elements [dual ele- 
ments] and euclidean symmetries [displacements] in 53. 

The kinematic mapping of plane elements and dual elements upon 
symmetries and displacements, respectively, furnishes us with the 
means of representing parametrically the composition of euclidean 
motions in $3. An example will suffice to show how this can be done. 
Let the coordinates of the dual element {s, 7} represent the displace- 
ment that corresponds to {s, 7} in the kinematic representation; let 
the displacement corresponding to {s3, 03} be equivalent to the prod- 
uct of the displacements corresponding to {5,01} and {se, o2}, and 
in this order. With these conventions it follows that 


$3 = 25102 + Se, = 0102; 


in a similar manner we can represent other combinations of euclidean 
motions. 


5. The group of whirl-similitudes. Let S represent the pair of simili- 
tudes: 


Pah r* = kr 


where k is a positive real number, and /, r are the kinematic image 
points of [/, r]. Let Gi,=SGG{, and H{,=SHj,. Then the eight thir- 
teen-parameter families comprising Gj, and Hj3, (¢=1, 2, 3, 4), con- 
stitute a mixed group l\;—the complete group of whirl-similitudes. This 
group is the space analogue of the group of whirl-similitudes in the 
plane.” Evidently, turbines are transformed into turbines under Ts. 


BROOKLYN COLLEGE 


® E. Study has given such a representation by means of Clifford biquaternions, 
op. cit. 
10 Feld, op. cit. 
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A THEOREM CONCERNING THE GEODESICS ON 
A PARABOLOID OF REVOLUTION! 


DONALD LING AND LEON RECHT 


We shall examine the behavior in the large of the geodesics on a 
paraboloid of revolution. It will appear that on every geodesic there 
is a unique point—the “point of symmetry”—which divides the geo- 
desic into two “conjugate rays,” which are mirror images of each 
other in the plane determined by that point and the axis of the parab- 
oloid. Every geodesic except the meridians, which have no singu- 
larities whatever, has infinitely many double points, but no other 
singular points. If, starting at the point of symmetry, we imagine 
the conjugate rays of a geodesic to be traversed simultaneously by two 
points which. are images in the plane of symmetry, and which con- 
tinue until they meet for the first time—at the “first double point” 
of the geodesic—we shall have generated a “loop of type 1.” If, start- 
ing at the point of symmetry, they pass through the first double 
point and continue until they next meet—at the “second double 
point”—we shall have formed a “loop of type 2.” We define in this 
way loops of types 1, 2, 3, -- - . The last double point to be reached 
in the process of generating a loop of a given type will be called the 
“vertex” of the loop. 

The main result of this paper is the following: The paraboloid can 
be divided by planes perpendicular to the axis into infinitely many 
zones, each zone to include the points of that bounding circle which 
is nearer the vertex. The zone containing the vertex we call the “cap,” 
the following the “first zone,” and so on. The zones have the following 


properties: 
1. Each interior point of the kth zone is the vertex of exactly two 
distinct loops of each of the types 1, 2,---, k, but is the vertex of no 


loop of type greater than k. 

2. Each boundary point in the kth zone is the vertex of two loops 
of each of the types 1, 2, -- - , R—1 and of one loop of type &. It is 
the vertex of no loop of type greater than k. 

3. No point of the cap is the vertex of any loop. 

Let the paraboloid be that obtained by revolving r? =x about the x- 
axis. As coordinates of a point P on the surface (the vertex excepted) 
we take 7 to be the length of the vector R from P perpendicular to 
the axis and directed toward P, and 6@ to be the angle which the plane 
determined by P and the axis makes with an arbitrary plane contain- 


1 Presented to the Society, February 24, 1940. 
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ing the axis. The differential equation defining the geodesics (cf. Dar- 
boux, Legons sur la Théorie Générale des Surfaces - - - , vol. 3) is 


cf 4? +171 


dr 


where c is an arbitrary constant. Corresponding to c=0 are the merid- 
‘ans 6=const. Since their nature is obvious we may except them from 
consideration, and assume c #0. The solution of (1) corresponding to 
a given value of c¥0 and passing through (ro, 4) is 
4r? + + 1)1/? + 2(r? — c?) 1/2 
—— —clo 
6 (4r? + 1)1/2 — 2(72 — 
+ clo 


6 = 0 + arc tan e| 
(2) 


r? — ¢? 


— arc tan | 
c? 


= 00 + f(r; ro, c), say. 
As r—|c| the right member of (2) approaches 
6 = 0 + — sign c — arc tance 
2 r— ¢ 
(479+ 1)" + 


(3) 


+ c log 


It can be shown without difficulty that (| c| , 9) is a point of the corre- 
sponding geodesic.” It is apparent from (2) and (3) that if c<0 @ in- 
creases monotonically from § at r=|c| to » at r=, while if c>0 
decreases monotonically from § at r=|c| to — at 
Consider the two solutions 

= + f(r; ro, ©), 

9°) {00 + 2f(| )} + f(r; ro, — ©) 
2f(| ¢| 5 )} — 
It is clear that acn(| c| ) =9°)(| c| ) =8. Moreover, at the point (| c| , 0) 


the two solutions have a common tangent, and thus belong to the 
same geodesic. Furthermore 


(4) (r) — 6 = f(r; 70, c) — f(| c| 70, = — [0 (r) — 5] 


It is readily verified that if r and @ are expressed in terms of the arc length s 
along the geodesic measured from (ro, 9), then r—|c| and 6—@ as s approaches a 
certain finite limit. 
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so that the points (r, 0(r)) and (r, 0(r)) are mirror images of each 
other in the plane determined by the axis and the point (| c| , 6). This 
is ‘he point of symmetry of the geodesic, and equations (4) give a pair 
of conjugate rays as defined above. Bearing in mind that @(r) and 
6°)(r) are monotonic in opposite senses, and that the absolute value 
of each is large with r, we see that each geodesic has infinitely many 
double points, but no other multiple points. 

Assume @)=0. Then @ is the angle through which the radius vector 
R turns as (r, 6) moves along a geodesic ray from its initial point 
(ro, 0) to its point of symmetry (| cl, 6). We shall study the variation 
of @ as the parameter c, which controls the direction of the geodesic 
ray at its initial point (ro, 0), varies over non-negative values. 

We find from (3) for c>0 that 


db + 1712 


dc —¢ 


As i increases from 2 to ~ the logarithm in (5) decreases monotoni- 
cally from © to 0. Thus d6/dc =0 has a single root A» >2. In terms of c, 
from the second of equations (5), d@/dc vanishes when and only when 
c=c*=((3—4)r,—1)". This root is real if, as we now assume, 
If c>c*, d0/dc<0; if c<c*, d8/dc>0. Since c) 
is continuous in for O0OSc<ro, and since 6(ro, 0)=27/2 and 
lim..,, c) =0, O(ro, c) attains its greatest value, 0*(ro), at c=c*. 
From (3) and (5) 


6*(ro) = 2/2 — arc tan 
+ — 4)ro — 1) 


This function is continuous for ro>=1/(A4j;—4)"/? and its derivative, 
is positive for ro = 1/(A§ —4) 2. Thus 6*(r9), which 
gives @ for the geodesic ray of greatest @ through (ro, 0), is con- 
tinuous and strictly increasing. Moreover 6*(1/(Aj—4)!/2) =2/2 
and lim,,.. 0*(ro)= ©. There is, then, a sequence of values of 
<1 <1@ < such that 6*(r)=ke (k=1, 2,---). The 
portion of the paraboloid defined by 1 <ro<r*” we shall call the 
“kth zone,” and the parallel ro=r its boundary. Let ro be an in- 
terior point of the kth zone; then ka <6*(ro) <(k+1)7. As c increases 
from 0 to c*, 8(ro, c) increases from 1/2 to 6*(ro), and as c then in- 
creases from c* to ro 0(ro, c) decreases from 6*(r9) to 0. Hence there are 
two sequences of values of c, {ci”, - -- and {c?,---, 
satisfying <c* << - . and the 
property that ott) = B(ro, =mrz, , k). When 1 is 


(6) 
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on the boundary of the kth zone, ci? =c* =c®’; but, for m<k, c® and 
c® are distinct. 

Interpreted geometrically this means that through a point (ro, 0) 
in the kth zone there are, for each value of m=1,---, k, two geo- 
desic rays for which 6= mr. The conjugate ray of each, being a reflec- 
tion of the original ray, clearly passes through (ro, 2m7), which is 
identical with (ro, 0). This point is therefore the vertex of two loops 
of each of the types 1, - - - , &. If (ro, 0) is on the boundary of the &th 
zone this statement must be modified to the effect that the loops of 
type & are coincident, those of lower types remaining distinct. 

It remains to examine the cap, that is, that portion of the 
surface for which If 0*(r0)<m@ and 
no loop with vertex (ro, 0) is possible. If r¢o<1/(—4)*/2, then 
A= —c?) and As c in- 
creases from 0 to 7o, 8(ro, c) decreases from 2/2 to 0. Thus no loop 
having a vertex in this portion is possible. We have now established 
all the results stated in the introduction. 

The numbers 7% can be calculated from (6). For, if we set 
6*(ro)=nma and replace by (6) becomes 
p—arc tan p=(2k—1)m/2. Since the left member increases mono- 
tonically from 0 to © as pw increases from 0 to o, there is 
exactly one root pw, for each value of r=1, 2, 3,---, and 
rh” = [(u2+1)/Aj—4)]2. For large values of 1, uw, is of the order 
of nz, and 7 is of the order of [(n?x?+1)/(3—4) ]"/2. We have 
approximately \»)=2.40 and 1/(4j—4)/2=3/4. Thus 37/4 furnishes 


an estimate of r{ for large values of n. 
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PROJECTION OF THE SPACE (m) ON 
ITS SUBSPACE (c) 


ANDREW SOBCZYK 


In a paper in the Duke Journal, A. E. Taylor! remarks that it is 
an open question whether or not there exists a projection of the space 
(m), of bounded sequences, on its subspace (co), the space of sequences 
convergent to 0. In this note we make a few remarks which supple- 
ment those of Taylor on this question, and we point out that a nega- 
tive answer follows from a recent result of R. S. Phillips,? so that the 
question is now settled. 

Taylor shows that if a projection of the space (c), of convergent 
sequences, on the space (¢9) exists, it must be of norm greater than or 
equal to 2. This implies the same result for (m) on (co), since any pro- 
jection of (m) on (co) would be in particular a projection of (c) on (co). 

The space (c) is essentially of dimension only one greater than that 
of its subspace (co). This follows since (c) is obviously the set of all 
elements of the form x =x-+2X,, where X,=(1, 1,---), x%€(q), 
and ¢ is a number. If x= {x;} is any element of (c), the linear func- 
tional a(x) =t=lim,.. X, is of norm 1, and vanishes on the subspace 
(co). Now it is a remark of Bohnenblust* that for any subspace of a 
normed linear space L defined by the vanishing of a fixed linear func- 
tional on L, there exist projections of norm less than or equal to 
2+, for arbitrary «>0. Consequently there are projections of (c) on 
(co) of norm less than or equal to 2+e. 

There are projections of (c) on (c) which are of norm exactly 2, 
as may be seen as follows. If x = (x-+#X,)€(c), the general form of 
a projection of (c) on (¢) is 


Px=x + t{b;} = x + 4(Xi+ {b;}) 


where {b; } is any sequence of constants such that lim;.,. 6; = —1.4 To 
calculate the norm of P, we have || 
and ||x|| =|/x+2X,|| =sup, since Therefore 


|P|< <1 +sup: |b;|, and because of Taylor’s result this has the value 


1 The extension of linear functionals, Duke Mathematical Journal, vol. 5 (1939)? 
pp. 538-547; p. 547. 

2 On linear transformations, Transactions of this Society, vol. 48 (1940), pp. 516- 
541; pp. 539-540. 

3 Convex regions and projections in Minkowski spaces, Annals of Mathematics, 
(2), vol. 39 (1938), pp. 301-308; p. 308. 

* See Taylor, op. cit. 
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2 for all projections such that |b;| <1 for all 7. In particular the pro- 
jection Px=x—a(x)-X,=x, obtained by taking b;=—1 for all i, 
is of norm 2. 

Corresponding to any sequence {e:}, where, for each 7, e; is either 
+1 or —1, let us define a space c;... as the set of all elements of the 
form x+t{e;}, where x€(c). Any space cz... is equivalent® to (c) 
under the automorphism {x;}<>{e,x;} of (m). Also (co) is a subspace 
of cz..., since (co) is invariant under the automorphism. There are 
obviously projections of norm 2 of any cz... on (c), similar to the 
projections of (c) on (¢). 

If L and L’ are any two linear subspaces of (m), we define the sum 
L+L’ as the linear subspace of (m) which consists of all elements of 
the form x+<x’, where xCL, x’€L’. As usual we denote the closure 
in (m) of any subspace by a horizontal line over the symbol represent- 
ing the subspace. 

For any set of 1 linearly independent sequences X ;= {e:;} ,@j3=4+1 
or —1, j7=1, 2,---, m, let 1, denote the m dimensional space of all 
linear combinations )> t;X;. Suppose the X;’s are such that /, and 
(co) do not intersect, except in the origin. Then we may define a sub- 
space (c,) of (m) as the space (co)+/,. Similarly, for any countable 
sequence of X,’s, any finite number of which are linearly independ- 
ent, we denote the space of all finite linear combinations )_ t;X; by 
l,, and we define 1,,=1; («© =No). If J; and (co) intersect only in the 
origin, we define 


= (co) + = (co) + ly. 


(If 1, and (co) intersect only in the origin, each x€(c@)+/., has a 
unique decomposition x +x, x €(c), x El,; otherwise not. 
The space (co) +/., is not necessarily closed.) 

Let X:= {ea}, X2={e2}--- be the rows of an infinite matrix 
{{ e:;}}; let {{ e:;}}, denote the matrix of the first » rows of {{ e:;}}. Then 
any space (cn) or (c») is determined by a matrix {{e,;}}, or {{e:;}}. 


THEOREM 1. In any space (cn) [or (c.)| such that for every column C; 
which appears in the matrix {{e:;}}, [for each n] one of +Ci, —C; is 
identical with C; for an infinite number of values of 1, as i, there 
exists a projection of (cn) or (c.) on (Co), which is of norm 2. 


5 Two normed linear spaces are isomorphic if there exists a 1-1 transformation T 
between them which is linear (that is, distributive and continuous) in both directions; 
they are equivalent if in addition | T| =| T| =1. 
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Proor. If x=x™ +) 5, x €(c), define Px =x. Then 


7 

By the hypothesis on the matrix {f{e;;}},, and since if x= {x}, 
||x|| =sup, | [max; | >> t;e:;|. This implies || Px| 
< 2| x | for any x of (c,) or of the dense linear set in (c,,). In the latter 
case P has a unique continuous extension to (c,,), with range (co), and 
the norm is preserved. This verifies the theorem. 


|x- 


+ 


THEOREM 2. For every space (Cn), 1SnZ ~, there ts a projection of 
(Cn) on (Co) which ts of norm 2. 


Proor. Let (c,) be determined by X1= {ea}, X2={ee},---, as 
above. Consider first the case of any finite n. Since only a finite num- 
ber of different n-element columns are possible, there can be only a 
finite number of columns C; in the matrix {{e;;}}, which are not 
infinitely repeated. Since X; and X_2 are linearly independent, and /, 
intersects (co) only in the origin, the matrix {{e;;}}. evidently satis- 
fies the hypothesis of Theorem 1. Consider (c3) = (co) +/3. By altering 
at most a finite number of coordinates of X3, the matrix {{ e:;}}3 can 
be made to satisfy the hypothesis. Let the altered sequence X3 be X$ ; 
then Xj =X3;+y™, where y €(c). Therefore the space (c3)’ deter- 
mined by X,, X2, Xi coincides with (c3). Proceeding in similar fash- 
ion, we see that any space (c,) or (co) +l; may be represented by a 
matrix {{ e;;}}, or {{ e:;}} which does satisfy the hypothesis of Theorem 
1. Our theorem is thus proved. 


THEOREM 3. There exists a matrix E={ {ei;}}, such that: (1) the 
corresponding subspace l,, of (m) intersects (co) only in the origin, and 
is equivalent to the space l,,.. of absolutely convergent series; (2) (co) +l. 
is closed in (m), and l1,, and (co) are complementary subspaces in 
(Co) = (co) +12; (3) the projection of (c.) on (co) according to the comple- 
mentary subspace l., of (co) in (c..) ts of norm 2. 


Proor. Let + and — represent +1 and —1. The matrix E is 


- + + 


(The kth row of E consists of alternately 2‘—! plus signs and 2*-! 
minus signs, for all &.) In this matrix, obviously all possible finite 


++ -- + 4+ - 

++ 4+ 4+ -- - 
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combinations of +-’s and —’s occur in the columns. Thus we have for 
any The space /,, determined by E is therefore 
equivalent to /,,... Suppose that (co) and (/,,) have a common element 
x. Then there is a sequence {x,},x,=).%c;,.Xj, which converges to x. 
Since x€(c), by the construction of E we must have )-7|c;,.| 0 
as n— ©. But this implies x,—0, x =0. This verifies statement (1) of 
Theorem 3; statement (3) follows by Theorem 1. Statement (2) then 
follows by a lemma of F. J. Murray, that (for Banach spaces) the 
existence of projections and of complementary manifolds are equiva- 
lent.® 


THEOREM 4. There exists a subspace of (m) equivalent to the space (C) 
of continuous functions, such that: (1) the subspace (C) of (m) and (c) 
intersect only in the origin; (2) (C)+-(co) ts closed in (m); (3) the projec- 
tion of (C)+(co) on (co) according to the complementary subspace (C) 
of (co) im (C)+(co) is of norm 2. 


Proor. Let {r;} be the set of all rational numbers of the interval 
(0, 1), in their usual enumeration. Then any x =x(¢) of (C) is uniquely 
determined by the set of values {x(r:) } , while (m) may be regarded 
as the set of all bounded real-valued functions x= {x;} = {x(r,)} on 
the set { ri} . Clearly this correspondence of each x(¢)€(C) to its set 
of values {x(r,)} in (m) is an equivalence. Furthermore it is obvious 
that if lim;... x(r;)=0, x(#)=0. Therefore (C) according to this im- 
bedding intersects (co) only in 0. 

{x(r:)} corresponds to x(t)€(C), and x = {x (r:)} belongs 
to the subspace (c) of (m), then sup; | +x(r,)| > max | x(t) | 
=sup; | x(r;)| since x(r;)-0, and is continuous. Thus 
||x+-|| >||x|], and the operation defined by Q(x+x) =x is a pro- 
jection of norm 1 of (C)+(co) on (C). The operation P = J—Q is a pro- 
jection of (C)+(c) on (c), and |P| =|Z—Q|<]|Z|+]Q| =2. The 
proof of statement (2) of Theorem 4 is the same as the first part of 
the proof of Theorem 6, below. 

It is a well known theorm of Banach that every separable Banach 
space is equivalent to a subspace of (C).7 Therefore by Theorem 4, 
if (B) is any separable Banach space, there is a subspace of (m) equiv- 
alent to (B), such that the projection of (co) +(B) on (q@) through (B) 
is of norm 2. 


6 See F. J. Murray, Transactions of this Society, vol. 41 (1937), pp. 138-139. 

In a normed linear space L, two linear subspaces L; and L2 are called comple- 
mentary manifolds or subspaces if (1) they are closed in L, and intersect only in 0; 
(2) every xEL has a decomposition x=x:+x2, where 1€ Li, x2€ D>. It follows by 
assumption that this decomposition is unique. 

7S. Banach, Opérations Linéaires, p. 185, Theorem 9. 
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The result of Theorem 2 may be generalized, by removing the re- 
striction that the e;;s may have only the values +1 or —1. Let 
X= {ea} be any element of (m) — (co). Let S:= [tX,] be the space of 
all elements ¢X,, and let B= (co) +-Si. If P is a projection of B, on (co), 
define a sequence of functionals { ys(x) } by {yi(x)}, 
where x=x-+1X1, x €(c). The functionals y,;(x) are obviously 
linear, and y.(x)=0 for all ¢ and x €(c). Let y:(X1) =aa. Then 
tPX,=tX,+t{aa}€(c), and we see that the general form of a pro- 
jection of By on (c@) is Px=x+t{aaten}, where {aa} is any se- 
quence of constants such that lim;... (@a+ea) =0. 

If X:= {ea} as above, let lim sup; |ea| =a. Define X/ = {ei } by 
eu if | Sa, ew =a sign éa if | 2a. Then 
= {ea—eud } €(c), and the space is the same as B,. 
Thus without loss of generality we may always assume | ex1| Sa 
=lim sup; |ea| for all k. This assumption is made in the following 
paragraph. 

There are projections of any space B, on (co) which are of norm 2. 
For if Px=x+t{aa}, then ||Px|| |aal. Since ||| 
=||x +2{ea}|] =sup, |x +ten| >sup; |tea|, we see that | P| is 2 
for all P such that |aa| <|ea| for all i. (Note that in no case can | P| 
be less than 2.) 

For any finite or countable set of linearly independent sequences 
X;= {e:3} , (e;;'s not restricted to +1), let S, or S; respectively denote 
the subspace of all linear (finite) combinations )) ;X ;. As before, sup- 
pose that S, or S; does not intersect (co), except in the origin. Define 
B,=(c)+5S,, By = (co) + S;, B. = B;. The general form of a projection 
of on (co) is Px=x+{ where x=xO4)> x 
and the a;;’s are any constants such that lim;.,, (@:;+¢;;) =0 for each j. 
This follows by additivity of P, since P must be in particular a pro- 
jection of each of the subspaces (co) + [#X,] of B, on (co). 


THEOREM 5. If Wis any separable Banach subspace of (m) such that 
WD (co), there is a projection of W on (co) which is of norm 2. 


Proor. It follows easily from the hypothesis that W is separable, 
that there exists a sequence {X;} such that either W is a space B, 
(n finite), or 


W = B,, = (co) + S;. 


(Conversely, any space B, or B,, is separable.) 

Consider any space B, or B, determined by a matrix E= { {e,;}} 
which is such that the e;;’s have only a finite number N of different 
values. By an argument similar to the proofs of Theorems 1 and 2, 
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we see that there is a projection of B, or B; on (co) which is of norm 2. 
This projection is specifically the one given by the general form above 
when a;;= —e;; for all 7, 7 (after the e;;’s have been suitably altered). 

For any space B, or B;, without loss of generality we may assume 
that || X,| =1 for all For each integer N, divide the interval OS u<1 
into N subintervals OS N-!, N-'<us2N-1,---, (R-—1)N-!<u 
<kN-,---, (N—1)N-1<uS1. If |e;;| lies in the kth of these in- 
tervals, let ki; =kN sign e;;. In this way we associate with the matrix 
{{eis}}, of B, a matrix {{k,;}},, in which the k;;'s have only the 
values +kN-!, k=1,---, N. By a finite number of alterations, the 
matrix {{%,;}}, may be changed into a matrix {{k}}, in which 
every column is infinitely repeated; this requires addition of integral 
multiples of + N-! to a finite number of the &;;’s. Alter the matrix 
{{ e:;}}, by adding the same multiplies of + N-1 to the corresponding 
e:’s; we thus obtain a new matrix {{e}}, which represents the 
same space B,, and which has {{ k”}}, for its associated matrix. Let 
of {fe }} be denoted by {X$”}; those of {{ }} by 

If for each N let x =x 45°" where x) €(c). 
Let x) =P™)x, Let N take on only the succession of values 2’, 
v=1, 2,3, ---. Then evidently we may make the required alterations 
in the matrices {{k™}}, (and {{e}},) in such a way that 
limy ,y’|| =0 for each j; that is, is a Cauchy se- 
quence for each j. If and 
then 4° — XM), and 


Therefore for each x€B,, {x‘™)} is a Cauchy sequence of elements 
of 
Let « =limy x); and define Px =x. Then 
N wy), || 
| le — + Duly; — X; )| 


| | | N 
< + | | + | x”) 


We also have 


8 The #;’'s do not change with N, because in the expression of any x, for each 
succeeding WN the finite alterations can be compensated by changing x? __and for 
each system fart: the expression of x is unique (by the hypothesis that S, inter- 
sects (¢) only in the origin). 
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=f + 


()| 


+ | 


(0,N) 
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IV 


(Ny) 
—X; ) 


(N || 
i i 


by the triangle property of the norm, and since each column of 
{{k}}, is infinitely repeated. Therefore ||P x|| =|]x‘-™|| 
Taking the limit as © (vy) in this in- 
equality, we see that ||Px||<2||x|] (since || PO 
—||Pa||| 0, || >|] Pal). 

The operation P, defined for each xB, or By, is clearly a projec- 
tion of norm 2 of B, or By; on (c), and our theorem is proved. (In 
case of B;, as in Theorem 1 P has a unique extension to B,=B,,= W.) 

The foregoing discussion has produced instances of the existence of 
projections defined on subspaces of (m) to (co), culminating in Theo- 
rem 5. Using the result of Phillips mentioned above, we now show 
that there is no projection defined on (m) to (c).° This result is per- 
haps rather surprising, since in all of our previous instances, not only 
did projections exist, but also there were projections of norm ex- 
actly 2. (One might expect to be able to find easily a succession of 
larger and larger subspaces containing (co), for which the greatest 
lower bounds of the norms of projections would increase unbound- 
edly.) Theorem 5 shows that the nonexistence of a projection on (m) 
to (c) is due in an essential way to the inseparability of (m). 

Phillips’ result is that there is no projection on (m) to (c). It follows 
easily from this that there is no projection on (m) to (cp), in view of 
the following simple theorem. 


IV 


THEOREM 6. Suppose that l,, l, are any pair of complementary sub- 
spaces in a Banach space L, and that Il? , lz’ are any pair of complemen- 
tary subspaces in lz. Let Li=(1,+1/). Then Li is closed in L, and Ly 
and L2=Iz' are complementary subspaces in L. 


Proor. Let P be the projection of L on /, according to J. Suppose 
that {x,} is any sequence of Li, such that x,—x in L, and let 
, Where xmCh, Ely. Then Px, =x Eh, and 
—Xu—(I—P)x =x? by the continuity of P 
and of Therefore =x, so that xC Ly, and 
L, is closed. Moreover, /; and l7 are complementary subspaces in 1. 


® “Projection of (m) on (co)” of course means the same as “projection on (m) to (¢o).” 
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By hypothesis, for any xEL, x +x2’, where 11.€h, 
, x2’ Elf’, and this decomposition is unique. Therefore the de- 
composition x =(x1+x7 according to L; and also is unique. 
This verifies Theorem 6. 

If there were a projection of (m) on (co), there would be a comple- 
mentary subspace /, to in (m). The element X,=(1, 1,--- ) 
€(m) would have a decomposition X,;=x-+X, x«™€(e), XEh. 
Consider the one-dimensional subspace consisting of all elements of 
the form X. By the Hahn-Banach theorem, there exists a projection 
of norm 1 on any one-dimensional subspace. Thus if // = {tX}, there 
exists a complementary subspace in The space Li=(1,+/2 ) of 
Theorem 6 is (c), and by Theorem 6 and the lemma of Murray al- 
ready mentioned, there would exist a projection of (m) on (c),-con- 
tradicting the result of Phillips. Thus there can be no projection of 
(m) on 

By an argument identical with that used by Phillips to prove the 
nonexistence of a projection of (m) on (c), it may also be shown di- 
rectly that there is no projection of (m) on (¢&). Theorem 6 will be 
required, however, for another remark to be made below. (It should 
perhaps be mentioned that (c) is isomorphic to (c).1°) 

By Theorem 5 and since there is no projection of (m) on (co), there 
is no projection of (m) on any separable subspace WD (co). (For if Q 
were a projection of (m) on W, P a projection of W on (9), then PQ 
would be a projection of (m) on (c).) The following question now 
arises: Does there exist a closed linear subspace L of (m), LD(c), 
such that there is no projection of (m) on L, and no projection of L on 
(co)? Of course by Theorem 5, such a space L cannot be separable. 

An extension theorem found in the paper by Phillips which has 
been cited!! is as follows. Let X be any linear subspace of a Banach 
space Z, and let U be any linear transformation on X to (m). If 
y= {ys} = Ux, the functionals y,;(x) are obviously linear, and by the 
Hahn-Banach theorem they may be extended to Z with preservation 
of the norm. Let 2;(z) denote the extension of y;(x). Then Uz = {z,(z) } 
is a linear transformation defined on Z with range (m), which coin- 
cides with U on X, and | U;| = | U| . As a consequence of this, if X is 
isomorphic to any subspace (B) of (m), and if there exists any Banach 
space WDX such that there is no projection of W on X, then there 
is no projection of (m) on (B). For if U is the isomorphism, U; its 
extension on W, and if Q were the projection of (m) on (B), then 
U~'QU, would be a projection of W on X. 


10 Banach, op. cit., p. 181. 
11 P, 538, Theorem 7.1, Corollary 7.2. 
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It follows immediately by the previous paragraph that not only is 
there no projection of (m) on (co) in the usual orientation of (co) with 
respect to (m), but also that there is no projection of (m) on (co) in 
any imbedding whatever of (co) in (m). (Let W be (m), let X be (c) 
in the usual imbedding, let (B) be (co) in an arbitrary imbedding in 
(m), and take U to be the identity on (co) to (¢o).) Also, since as has 
been shown by Phillips!* there exists no projection of (C,) (the space 
of functions having only discontinuities of the first kind) on (C), it 
follows that there is no projection of (m) on (C) in the imbedding of 
Theorem 4, or in any other imbedding. 

Another interesting consequence of Phillips’ extension theorem is 
that for every Banach space ZD(m), there is a projection of norm 1 
on Z to (m). (Extend to Z the identity transformation of (m)CZ 
into (m).) Also, consider the finite dimensional spaces l,, ,, of sequences 
x=(x1,°- +, %n), with norm ||x||=max; |x;|. In a similar way, it fol- 
lows that there is a projection of norm 1 on any Banach space 
Z Dla.» to 1.,n. There is probably some connection between this and 
the result of Theorem 5, since (co) D/.,, for every n. A question which 
arises here is whether the most general separable Banach space having 
(co) as a subspace is contained in (m) in the same relationship to (co) 
as in Theorem 5. Anyway, if B is any separable Banach space having 
a subspace equivalent to (co), there is a projection of norm 2 of B 
on (9); for let U be the identity on (co) in B to (co) in (m), extend U 
to be U; on B to (m) by Phillips’ theorem, and let the proper range 
of U; be W’. By continuity of U;, W’ is separable. Let P’ be a projec- 
tion of norm 2 on W’ to (¢9), as given by Theorem 5. Then U-!P’U, 
is the required projection of norm 2 on B to (¢9). 

Any separable Banach space Y is equivalent to a subspace (B) of 
(m). The question of whether there is an infinite dimensional, sepa- 
rable Banach subspace (B) in (m), such that there is a projection on 
(m) to (B), is equivalent to the question of whether there is a separa- 
ble Banach space Y, such that for every Banach space WD Y, there 
is a projection on W to Y. A further question is whether such a sub- 
space is necessarily reflexive. ((co) is not reflexive.) 

Let Y be any separable Banach space; imbed Y in (C) and (C) 
in (m). If there is no projection on (C) to Y, then there is no projection 
on (m) to Y in any imbedding. If there is a projection on (C) to Y, 
then using Theorem 6 and the facts that any two complementary sub- 
spaces of the same closed linear subspace are isomorphic, and that 


2 Loc. cit.. p. 539. 
18 A direct imbedding is given by Phillips, loc. cit., p. 524. 


1941] STIELTJES INTEGRALS 947 


there is no projection on (m) to (C), it may be shown that at least 
either there is no projection on (m) to Y, or else there is no projection 
on (m) to the complementary subspace of Y in (C). (An illustration 
of the case where there is no projection on (m) to the complementary 
subspace in (C) is provided by the case of a finite dimensional 
YC(C).) 

In a paper in preparation on the extension of linear transforma- 
tions, the writer intends to discuss the questions indicated above, and 
related questions. 


OREGON STATE COLLEGE 


SEQUENCES OF STIELTJES INTEGRALS! 
H. M. SCHWARTZ 


Statement of results. Sequences of Riemann-Stieltjes integrals? 
have as yet been little studied, only the following fundamental results 
being known. 


THeEorEM A (Helly [2]). Let ga(x) (n=1, 2, ---) be an infinite se- 
quence of real functions defined in the finite closed interval I=(a, b) 
which satisfy the following two conditions: 


(1) Total variation of g, in I = Vi(gn) S M, M a fixed constant, 
(2) gonl, ©; 


then for any function f(x) continuous in I, we have* 


(3) f fae. 


THEOREM B (Shohat [3]). Let {gn} be a sequence of functions mono- 
tonic and uniformly bounded in I and such that 


(4) gr —gonE, Ea set dense on I and including the end points a, 6 of I, 


where g is a monotonic function (all the functions g,, g monotonic in the 
same sense); then we have (3) for any function f(x) for which 


1 Presented to the Society, January 1, 1941. 

2 A discussion of such integrals with references is to be found in [1]. (Numbers 
in brackets refer to the bibliography.) 

3 When the limits of integration are omitted, it is to be understood that they are 
the end points a, 6 of J. 


| 
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(5) exist, *22=1,2,---, 


(6) f fdg exists. 


Theorem A was rediscovered by Bray [4]* and a condition weaker 
than (4), which is both necessary and sufficient, was found by Hilde- 
brandt [5].* 

The present paper is devoted to a preliminary study of conditions 
governing convergence of the sequences in question when neither con- 
tinuity of f nor monotonicity of g, is assumed. The goal, not attained 
here, is to find necessary and sufficient conditions to be imposed on 
the sequence {g,}, where the functions g, belong to the class V of 
functions of bounded variation in J, in order to insure the validity 
of (3) for every bounded f for which (5), or (5) and (6) hold true. Two 
necessary conditions can be given at once. Condition (1) is necessary 
since it is already necessary if we require that f belong only to the 
class of functions continuous in I [5]. It is interesting to note, how- 
ever, that condition (1) is not necessary if we require that f belong 
only to V, as is seen by the following theorem: 


THEOREM 1. If we have (2), then in order that we have (3) for every 
function f of V for which (5) and (6) are true, it is sufficient that the 
sequence {ga} be uniformly bounded. 


The other necessary condition is obtained by taking for f functions 
which are constant everywhere in I except for a single point. It is 
easily found that the set C of points of convergence of g, to g must 
include all the common continuity points of g,, g as well as the end 
points a, b of I. Without going into the question of necessity® we 
shall assume that C contains all the continuity points of g (whether 
they are also continuity points of all g, or not), that is, that 


gn —gonC, gin V, 
(7) 


C consisting of the continuity points of g and of a and b. 


4 In Bray’s formulation of the theorem condition (2) is replaced by (4); but it is 
then necessary to specify that g is a function of bounded variation; and by the lemma 
given below, it is seen that the two formulations are compietely equivalent. 

5 The following additional reference was brought to the author’s attention by the 
referee: Hildebrandt, Stieltjes integration of the Riemann type, American Mathematical 
Monthly, vol. 45 (1938), pp. 265-278, Theorems 2.41; 5.41. 

6 It is to be noted that in Theorem B as well as in its generalization, Theorem 3, 
conditions (4) and (7) are equivalent, as can be readily proved. 
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The question arises whether condition (7) represents in the problem 
under consideration an essential weakening of condition (2). That this 
is not the case is shown by the following lemma. 


Lema. Under the conditions (1), (5), and (6), if (3) ts implied by (2), 
it will also be implied by (7). If we are considering only a certain f satis- 
fying our conditions, then C need consist of only those continuity points 
of g which are also discontinuity points of f (in addition to being dense 
on I and containing the points a and b). 


It is thus no loss of generality to assume condition (2) rather than 
(7) in the following theorems. When this is the only condition im- 
posed on the sequence {g,}, only the following result (besides that 
given in Theorem 1) has so far been obtained. 


THEOREM 2. If f has at most singularities of the first kind, then (1), 
(2), (5), and (6) imply (3). 


A direct generalization of Theorem B is given in the following theo- 
rem. 


THEOREM 3. If in addition to (2) we assume that 
(8) Vr(gn) — Sn, in V, 
then (5) and (6) imply (3). (Condition (1) is of course implied by (8).) 
The next theorem gives a result of wider scope. 
THEOREM 4. To insure the validity of (3) it is sufficient to assume in 
addition to (1), (2), and (5) that? 
(9) for every limit function v(x) of the sequence 1 Valgn) } frac exists. 


In some applications the interval of integration is infinite and it is 
therefore of interest to consider that case. This is treated here only 
briefly, the following results being immediate.® 


THEOREM 5. For I=(a, b)=(a, ~), tf we assume (2), (5), and 


(10) f fdgn ~f fdg for every finite c > a; f fdg exists, 


7 The existence of at least one limit function of the sequence given in (9) is insured 
by Helly’s theorem of choice [2]. Note also that (6) is implied by (9), as can be 
seen, for example, by using the integrability condition on p. 542 [1] along with 
Theorem 2 in [7]. 

8 In order to simplify the formulas, we take only one of the limits as infinite. 
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then a necessary and sufficient condition for the validity of (6) and (3) 


is that we have either one of the following two conditions (e being an arbi- 
trarily assigned positive number) : 


(11a) | J 


(11) | 


IV 


Cy 


IV 


n= n.,c = Cle, n). 


Note 1. Condition (10) of the theorem can of course be replaced 
by the appropriate conditions in the preceding theorems which insure 
the validity of (10); thus if f is continuous in J, we can replace (10) 
by the requirement: 


(12) Vilgn) M(o). 


Norte 2. If fis bounded in J, then (11a) can be replaced by the rela- 
tion V2(g,)<e, c2c., n= N(e, c). 

Note 3. If f is bounded in J and g is of bounded variation for 
sufficiently large x, then we need assume only the validity of a relation 
like (4), and (11) can be replaced by the following corresponding rela- 


tions: 
f fdgn 
ay 


| 


where {ax} is some sequence in E converging to ©. 
Note 4. If we have (12) and if 


lim B {| }-M@ = 0, 


k 


IV 


IV 


k.,n = N(e, k), 


IV 
IV 


<e, n2n,,k= K(e,n), 


then we certainly have (11a). (The above relation will hold for in- 
stance when limz..,. f(x) =0 and M(c) is independent of c.) 

By the use of Theorem 5 we can give a direct proof of the existence 
of a solution of the moments-problem of Stieltjes-Hamburger.® This 
proof as well as other applications of the above theorems will be given 
elsewhere. 


Proof of the lemma. The proof will be based on the following aux- 
iliary lemma. 


® The proof given in the literature is indirect, involving complex function theory. 
A discussion of this problem and references are to be found in [6]. 


— 
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LEMMA. Let g and h be two functions of V which coincide in value on a 
set D, and let f be a function bounded in I. Then the existence of the 
integral of f with respect to one of these functions, say g, will imply its 
existence with res pect to the other function h, and the equality of the values 
of the two integrals, provided D is dense in I and includes the end points 
of I as well as all those discontinuity points of f at which h has an ex- 
ternal saltus. 


Proor. That /fdg={fdh when both integrals exist follows directly 
by the definition of Stieltjes integrals and the fact that D is dense in I 
and includes the end points of I. To prove that ffdh exists it is clearly 
sufficient to prove that /fdk exists, where k=g—h; and for that it is 
only necessary to show that the variation of k over the set of discon- 
tinuities of f is zero [1]. This can be readily shown when we note that 
by our assumption k(x —0) =k(x+0) =0, and that therefore, if we de- 
note the discontinuity points of k by u; (4=1, 2, - - - ), the total varia- 
tion of k over the set obtained by excluding from J arbitrarily small 
intervals about the points 1, ---, um, can be made as small as we 
please provided m is taken sufficiently great; and that further by our 
assumptions, f is continuous at each 1;. 

To prove the lemma, we note that by (1) the sequence { ffdg,} 
={J,} is bounded and it therefore contains at least one convergent 
subsequence. Let {J,-} be any such convergent subsequence and de- 
note its limit by J; let {g,-} be a convergent subsequence of the 
function-sequence {g,-} (existing by Helly’s theorem of choice [2]), 
and denote its limit function by g*. Now it can easily be proved that 
by the condition given in the lemma, no limit function of {g,} can 
have an external saltus at a discontinuity point of f. We can apply 
therefore the above auxiliary lemma and conclude that {fdg* exists 
and equals /fdg. Hence by our assumption, it follows that ffdg,-.—fdg, 
and consequently also J=ffdg. But {J,-} was an arbitrary conver- 
gent subsequence of {J,}; hence {J,} has the unique limit J, that is, 
we have (3). 


Proof of Theorem 1. It is clearly sufficient to consider f monotonic. 
By the formula of integration by parts for Stieltjes integrals [1], it is 
seen that the theorem will follow if we show that 


(*) — fear 


To this end, we employ Lebesgue’s theorem on the convergence of se- 
quences of Lebesgue integrals with boundedly convergent integrands. 
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Putting A =f(a), B=f(6), Ga(y) =gn(f-"(y)), Gy) =g(-"(y)), we have 


[1]: 
fouar = foar= “cones: 


and as by our assumptions, all the conditions of Lebesgue’s theorem 
are satisfied, it follows that [G,(y)dy—fG(y)dy, and that we have (*). 


Proof of Theorem 2. This theorem follows from the following three 
lemmas (Lemma 2 is more general than the present use of it requires). 


LemMA 1. Theorem 2 1s true tf f is in V. 


Proor. Let f=f.—fa be the decomposition of f into its continuous 
part f. and its discontinuous part fz [1]. By Theorem A, /f.dg,—/f.dg, 
and it remains to be shown that we also have the relation [fadg,.— Sf adg, 
or the relation corresponding to (*) above. Now [gndfa=)m¢m&n(tUm), 
where =f(um+0) —f(um—O), and as the series (tm) converges 
uniformly with respect to , being dominated by 


Bt 


which is finite by our assumptions, it follows that 


lim |gndfa = dmg(tm) = 


LemMaA 2. If the sequence of bounded functions fm converges uniformly 
in I to the bounded function f, and if we have (5), (6), and (3) with f 
replaced by fm (m=1, 2,-- +) then we have (5), (6), and (3), provided 
we assume (1) and the finiteness of V1(g). 

Proor. We show first that the existence of ff,dg and the uniform 
convergence of f,, to f in J implies the relation 


(*) fas exists, finds fias. 


g being assumed to belong to V. Let 
Tk 
Sme = Dy fm(ui) [g(xi) — g(xi-1)] 


be the kth Riemann sum of a sequence whose limit for Rk ~ is /fndg, 
and let s; represent the corresponding sum for f. As 


| sme — se] fm }-Vilo), 


zin 


| 
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we see that s,, approaches its limit s, uniformly with respect to k, 
and that therefore lim;... and limn.. {fmdg exist and are equal. Re- 
lation (*) now follows from the fact that the first limit is independent 
of the choice of the division points x; and the between-points u; in 
the Riemann sums. 

Now by (1), 


| fits. 


that is, /fmdg, converges to /fdg, uniformly with respect to m. Hence 


lim = lim lim finden = lim lim f 


BL 


= lim finds = fies, by (*). 


LemoMaA 3. If f is a function having at most singularities of the first 
kind in I, then there exists a sequence {f,} converging to f uniformly in I, 
where each f, 1s in V and 1s continuous at the continuity points of f. 


Proor. We have to show that we can assign to every given positive 
and arbitrarily small number e, a function f., which is in V, is con- 
tinuous at every continuity point of f, and approximates f within e. 
By assumption, f(x—0) and f(x+0) exist, and we can therefore as- 
sign to every x of the semi-open interval (a, b],1° a left-side interval 
(x’, x), and to every x of [a, b), a right-side interval (x, x’’), in each 
of which the oscillation of f is less than e/4. By the Heine-Borel cover- 
ing theorem, there exists a finite set of points x; in J such that the sum 
of the corresponding intervals (x/, x/’)=J; (including the intervals 
[a, a’’) and (0’, b]) covers J, and we may assume without loss of gen- 
erality that the overlapping parts (x/41, x/’) of consecutive intervals 
I; (necessarily nonvacuous, as the J; are open) lie inside (x;, x;+1). 
It can now be seen without difficulty that we can take as the required 
function f., a function defined as follows: It has the values f(x;), 
f(x;—0), and f(x;+0), respectively for x=x;, for x in (xj_,, xx), and 
for x in (x;, x/41); and it is linear in the remaining intervals. 

It is apparent how Theorem 2 can be derived by the use of the 
above three lemmas. We need only observe that the functions f,, of 
Lemma 3 (with f of Theorem 2) satisfy the relations (5) and (6) 
(with f=f,,) because they are in V, and have no common singularity 


10 The parenthesis will represent the open, and the bracket, the closed end of an 
interval. 
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points with the functions g,, g (by their definition, in view of (5) 
and (6)). It is in fact known that two functions of bounded variation 
which have no common discontinuity points, are integrable with re- 
spect to each other.'! 


Proof of Theorem 3. Conditions (2) and (8) are equivalent [7] to 
(2) and the relation 


(8’) Ve(gn) Valg)s n— o;xin J. 


Hence, denoting by gn=f2—Qn, Z=P—g the Jordan decompositions 
of the functions g,, g, we readily see that all the conditions of Theo- 
rem B apply to the two sequences {pn}, {gn} ([fdpn, [fdan exist be- 
cause ffdg, exist [1]), and that therefore: /fdp,—ffdp, [fdq.—fdq, 
and hence (3) is satisfied. 

The theorem can also be proved directly as follows:!? Let 


Snk = fuss) [gn(xi,4) 2n(Xi-1,4) | 


represent the kth Riemann sum of a sequence of such sums which 
converges to /fdg,.=J, when then: 


| Ju Sun] Oses 


the subscript 7 referring to the 7th subinterval (x;-1,x, x:,.). By (6), 
given a positive number h, there exists a number K = K(h) such that 
Or=)>.: Ose; f(x) Vi(g) <h, R= K(h). Now let e be a preassigned posi- 
tive number. Fixing first k’= K(e/2), we can then choose, in view of 
(8’), a number N= Ne, k’) so that for all n= N, we have 

e 


| — Veg) | <—— :U = Oser i = 0,1, 
4r,-U 


Then 
rk’ 
Ose; f(x)Vilgn) < + U < e/2+e/2 =e, 
i=1 


k’ => K(e/2) = ke; n= Ne, k’). 


11 We note, incidentally, that by Lemma 3 and the first part of the proof of Lemma 
2, it follows that a bounded function which has at most singularities of the first kind is 
integrable with respect to every function of V with which it has no common singularities. 
2 This proof, which is different from that given in [3] for Theorem B, was found 
by the author in the winter of 1939, when most of the results given in this paper were 
developed. 


Tk 
t=1 
| 
1K’. 
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Combining this with (*), and remembering that e was arbitrarily as- 
signed, we finally conclude that lim,... J,=lim; lim, s..=ffdg, that 
is, that we have (3). 


Proof of Theorem 4. Let us suppose first that the limit function 
v(x) occurring in (9) is unique. Then if gz,=,— gn, represent the Jor- 
dan decompositions of the functions g,, we find: 


2pn(x) + g(x) + g(a) = p*(x), 2gn—> v(x) — g(x) + g(a) = g*(x). 


Since by (6) and (9), the integrals of f with respect to p* and g* exist, 
we can now proceed exactly as in the first proof of Theorem 3. 

In the general case, let {Jn} be an arbitrary convergent subse- 
quence of {J,}={ffdg.}. By Helly’s theorem of choice, in view 
of (1) and (2), there exists a convergent subsequence { Vnr(x)} of the 
sequence { on-(x) } = { Vi(g,") } , and it follows by the preceding discus- 
sion that J,,-—/fdg. By a familiar reasoning it then follows that we 
also have the relation J,—/fdg, that is, (3). 
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HEAT CONDUCTION IN AN INFINITE COMPOSITE SOLID! 
W. A. MERSMAN 


1. Introduction. The problem of one-dimensional heat conduction 
in finite or semi-infinite composite solids has been investigated exten- 
sively. The doubly-infinite case, however, seems to have been treated 
only for special initial temperature distribution functions.’ 

The purpose of the present paper is to treat the general case. The 
Laplace transformation is used formally in §2 to discover the solution, 
which is then rigorously established in §3. In §4, finally, a uniqueness 
theorem is proved, under more restrictive conditions on the initial 
distribution function. 

Lebesgue integrals are used throughout. 


2. The formal solution. Consider two plane-boundary semi-infinite 
homogeneous solids, composed of different materials, placed in per- 
fect thermal contact. If the conduction of heat takes place in only 
one dimension, perpendicular to the interface, the temperature 
U(x, t) satisfies the following differential system: 


0?U 
(1) — = 4,—_;; =2; 
ot Ox? 
(2) lim U(x,t) = f(x); x #0; 
(3) lim U(x,#) = lim U(x, 4); $>0; 
z—+0 
aU aU 
(4) lim ki — = lim kz —; $> 0; 


z——0 Ox z—+0 Ox 


where x is the perpendicular distance from the interface, ¢ is time, 
a, and k, are the thermal diffusivities and conductivities, respectively, 
of the two materials and are positive constants, and f(x) is a known 
function, defined for all real x except x =0, whose properties will be 
specified later. 

Denoting the common limit in equation (3) by ®(¢), the solution 
can be written in the well known form? 


1 Presented to the Society, April 5, 1941. 

2 Cf. Riemann-Weber, Die partiellen Differential-Gleichungen der mathematischen 
Physik, 5th edition, 1912, vol. 2, p. 98. 

3 Cf. H. S. Carslaw, The Mathematical Theory of the Conduction of Heat in Solids, 
2d edition, London, 1921, §§18, 23. 
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(5)* U(x, 1) = Vil!) + [ - Ja 


1 
2(ma, {ex | <4 | 


and V2(x, t) is obtained from V;(x, #) on replacing a; by a2 and in- 
tegrating from 0 to +. 

Now apply the Laplace transformation to equations (4) and (5), de- 
noting the transforms of U, V, and so on, by x, v, and so on, respec- 
tively: 


where 


Ou(x, s) s) 
(4’) lim ky = lim ke 
Ox Ox 


(5’)5 u(x, s) = $(s) exp [— | (s/a,)/2] + 0,(x, s). 
The unknown function ¢(s) can be eliminated between equations 


(4’) and (5’), and the solution is then obtained by applying the in- 
verse Laplace transformation to u(x, s): 


ki 0 (x + 
d 
U(x, t) Vilx, t) + Aax(xt)'/2 exp | t 
> 0: 
(6) ke (x = §)? 
d 
U(x, t) V2(x, t) + J | 4aot ] 
ky (xa, — = 
* Aa,(xt)*/? J | 4a, 
6, 


where A = (k;(a2)"/? + e(a1)?/?) /(ayae) 1/2. 


Throughout, x<0 when »=1, x>0 when »v=2. 

® The first term in the right member is obtained by the “Faltung” rule. For it and 
the specific transformations used, cf. G. Doetsch, Theorie und Anwendung der Laplace 
Transformation, Berlin, 1937, particularly the table of transformations in Appendix 2. 
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3. The solution established. We have the following theorem. 


THEOREM 1. If f(x) is Lebesgue integrable over any finite interval, and 
if there exist two positive numbers B, b, such that |f(x)| <B exp |bx|, 
then the function U(x, t) defined by equations (6) is a solution of the 
boundary value problem (1), (3), (4). The initial condition (2) is satis- 
fied uniformly in x in any finite interval 0<x,S | x| <x2 in which f(x) 
ts continuous. 


In the classical proofs of similar theorems® f(x) is always a bounded 
function. In adapting such methods of proof to the present situation 
it is only necessary to investigate the effect of the unboundedness of 
f(x). A complete proof is given here for only one typical integral in 
each case. The right members of (6) obviously converge. 

(a) The differential equation. If U(x, t) is differentiated with respect 
to ¢ under the integral sign, one of the resulting integrals is of the form 


2 
J(x,i) = pen f (x + exp | - (2 Ja 


We prove that, if x >0, 0<t;St<te, J(x, t) converges uniformly in ¢. 
From the hypotheses on f(x), 


(x + ale. 


| I(x, f (x + exp | - 
0 


Since the right member converges, and does not contain ¢, J(x, t) 
converges uniformly in ¢. 

(b) The initial condition. The solutions (6) are composed of two 
essentially distinct types of integrals, according as x and £ have the 
same or opposite signs. We first prove that, in the first case, the in- 
tegral vanishes with ¢. 

Consider 


J(x,t) = 


exp | x> 0. 


2(xt) 1/2 4t 


Make the change of variable £= —x+2{t'/?, then: 
J(x,t) = f(— x + 


From the hypotheses on f(x), 


® Cf. Carslaw, loc. cit., p. 31; Goursat, Cours d’ Analyse Mathématique, 4th edition, 
Paris, 1927, vol. 3, chap. 29. 


1 
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| W(x, )| S f exp [— ¢(¢ — 2641/2) 


2/2t1/2 


If x2x1>0, and t<x;/8b, then 
4B 
| I(x, t) | <— (t/m)” exp [— bx: — 


Hence J(x, ¢) approaches zero with ¢, uniformly in x in any semi- 
infinite interval 0 <x <x. 

An inspection of (6) now shows that, in order to prove that the 
initial condition (2) is satisfied it is sufficient to prove that 


lim [Ji(x, 4) — f(x)] = 0 


where 


x, =—— exp | — 4 x > 0, 

the approach to the limit being uniform in x in any interval 
0<xi1Sx54x. Given any e>0, we can choose a 6>0, independent 
of x and #, such that |f(£) —f(x)| <e if |x—é| <6, <a. Having 
chosen 6, we can write 


(7) | Ju(x, ) — f(x) | <f +f +f 4 | f(x) | "dt, 


gil? z/2ti/2 


where in each of the first three integrals the integrand is 


— f(x) | ex | - 
2(mt)1/2 4t 

Denote the right member of (7) by i +/2+/3;+J4. The unbounded- 

ness of f(x) is important in hh, J3, and J4. Iz is easily seen to be less 

than e. In J; make the change of variable £=x—2¢t'/*. Then 


8/2t1/2 


which obviously approaches zero with #, uniformly in x. 
From the hypotheses on f(x), 


B (x — &)? 
3 < >, d. 


| 
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Make the change of variable £=x+8t{/56; if t< 6/86, 


8B 


62/8t 


which obviously approaches zero with ¢, uniformly in x. 

(c) The interface conditions. Equations (3) and (4) will obviously 
be satisfied if x can approach zero within the integral signs. The proof 
is given for one typical case: 


Hat) = f ep [| - 


If 0<4StSh, x>0, 


— 
| J(x, 4) —J(0,d| | | exp | - Ja 


— 2— 2 


Denote the right member of (8) by J:+J2. In J, make the change of 
variable £=x+2¢t!/?: 


(8) 


2Bx 
of exp [— ¢(¢ — 2bt/?) 


t —2/2t1/2 


If the range of integration is split at {=1+2bt!/?, it is easily seen 
that 


e exp [20(1 + 2bt J}, 


1 


hs 


which approaches zero with x, uniformly in ¢. 
Since | 1—exp y| < | y| exp | y| , we have 


Split the range of integration at £=3+-4dr; if x<1, 


3+4bt, 
I, Batz*!? exp | ik f + 28) exp E 
1 0 1 


+f 1 +28 exp [— &/4n]at 


34+4bt, 


which approaches zero with x, uniformly in t. 
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4. Uniqueness. We first state the following definition. 


DEFINITION. A function U(x, t) defined for all real x and positive t 
will be said to satisfy conditions A tf: 

(Ai) U(x, t) ts a continuous function of x in —~» <x< © for any 
fixed t>0. 

(Az) U(x, t) is a continuous function of t in any finite interval 
0<4,:StSh, the continuity being uniform with respect to x in any finite 
interval x; Sx 

(As) U(x, t) is bounded for all real x, all positive t. 

(Ay) OU (x, t)/Ot exists and satisfies (A) and (Ae). 

(As) OU (x, t)/dx and 0?U(x, t)/Ox? exist and are continuous func- 
tions of xin — ~ <x<0 and 0<x< ~, for any fixed t>0. 

(Ae) |AU(x, t) /dx| <M/t'!!?,t>0, x0; M ts a constant independ- 
ent of t and x. 

(A7) U(x, t)-0 as |x| +0, uniformly in t in any finite interval 
0<tSh. 

(As) U(x, t)--0 as t-0, uniformly in x in any finite interval 
0<xm | x| 


THEOREM 2. The function U(x, t)=0 ts the only function satisfying 
conditions A that is a solution of the boundary value problem (1)—(4), 
with f(x) =0. 


Proor. Consider 


1° 
J(x,t) = — ]?dé, > 0. 
Assume that, for some x,+0, 4,0, | U(x, t1)| =C>0. From condi- 
tion (A;), | U(x, ti)| >C/2 in some 7-neighborhood of x. Hence 
J (x,t) >D, any x> | +n, t:>0, where 


k,C?n ‘ 
if n<0;»=2 if m>0. 
ay 


If M is the bound of condition (Ag), we can choose a fixed x2>|x1| +7 
such that 


| U(x2, t)| < 0<t 


IA 


hy 

(9) 

O0<ish. 

4k 


Having chosen x2, consider J(x2, ¢). From conditions (A;), (Ae), (As), 
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J (x2, t) can be differentiated with respect to ¢ under the integral sign; 
using equation (1), 


OJ (xe, t 0?U (x, t 
ot Ox? 


From condition (As) this may be integrated by parts; using equations 
(3) and (4), 


OJ (x2, t) OU (x, é) dU (x, t) 7]? 
[ | - i 


Ox 
0< 


z=—Z2 


Since the last term is nonpositive, 0//dt is not greater than the sum 
of the first two terms. Hence, from (Ag) and equations (9) 


OJ (x2, 2) D 
< 
at 
From conditions (A3), (As), J(x2, +0) =0. Therefore 


D 
f = D. 
J 


But J(x2, t:)>D. Thus the assumption that U(x, t:) 40 leads to a 
contradiction if x,;+0, t;#0, and by the continuity conditions (A,) 
and (Ag) the theorem is proved. 

We are now in a position to prove a more general uniqueness theo- 
rem. 


0<tsh. 


J (x2, ty) 


DEFINITION. Let f(x) be defined and continuous for all real x except 
x=0.A function U(x, t) will be said to satisfy conditions B with respect 
to f(x) if: 

(Bi) U(x, t) satisfies (A1)—(Ac¢) inclusive. 

(Be) U(x, t)—f(x) as t-0, uniformly in x in any finite interval 
| x| 

(B3) U(x, t)—f(x)—0 as | x| —«, uniformly in t in any finite in- 
terval 0<tSh. 


THEOREM 3. Let f(x) be defined and continuous for all real x except 
x =0. Then the boundary value problem (1)—(4) has at most one solution 
satisfying conditions B with respect to f(x). 


Proor. If there are two solutions, let U(x, t) be their difference. 
Then U(x, t) satisfies conditions A and hence vanishes identically. 
Finally, it remains to determine conditions on f(x) under which the 
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solution U(x, t) defined by equations (6) will satisfy conditions B with 
respect to f(x). Sufficient conditions are given by the following theo- 
rem. 


THEOREM 4. Let f(x) be continuous except at x=0, and bounded for 
all real x, and let the following limits exist: 
fi = lim f(x), = fe = lim f(z). 


Then U(x, t) as defined by equations (6) satisfies conditions B with 
respect to f(x). 


Proor. Conditions (A;)—(As) and (Bz) are easily seen to be satis- 
fied. We give the proof for (As) and (Bs). An inspection of (6) shows 
that it is sufficient to prove (A¢) for integrals of the form 


J (x, t) : f > 0;2.>-0 
exp | — x 

By differentiating inside the integral sign and splitting the range 
of integration at x=€ it is easily found that 

| aJ (x, t) | N 

< 

Ox (at) 1/2 


where N is the upper bound of f(x). 
For (Bs) it is sufficient to prove that 


(10) Ji(x, t) asx—>+ 0, 
(11) J2(x, t) — f(x) asx—>+ 0, 


the approach being uniform in ¢ in 0<¢#<h, where 


2 
Ji(x,t) = pan f f(& exp [ ( Ja >: 0; 
0 


1 = — §)? 
J2(x, t) exp | - x>0,i>0. 


To prove (10) make the substitution §= —x+2¢t!/? in Jj: 
1/2 


| Ji(x, t) | < aw f et dt < exp 
x 4t; 


which clearly approaches zero as x approaches infinity, uniformly in ¢ 
in0<tSh. 
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To prove (11), we have 


1 
8) — f2) = f - 10} ex | - Ja 


4t 
x 


2/2t1/2 


In the first term of the right member, make the change of variable 
&=x+2¢t'/2. Then 


3N 
| Jo(x, t) — f(x) | 
(12) 2/4ti/2 
1 
| a) — fle + | 
—2z/4tl2 


Since f(x) approaches a limit as x, the second term of the right 
member of (12) can be made less than any e>0 by choosing x>X, 
independent of t. The first term of the right member of (12) clearly 
approaches zero as required in order that (11) be true. 

It may be remarked that the conditions imposed on f(x) in Theo- 
rem 4 are merely sufficient, and not necessary. In particular, f(x) need 
not have limits as | x| — o. For example, U(x, t) —x as defined by (6) 
satisfies conditions B if f(x) =x. However, in most physical applica- 
tions f(x) will satisfy the hypotheses of this theorem, since the prob- 
lem considered here is an approximation, for small time values, to the 
finite composite solid problem. 
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